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ABSTRACT 


An  essentially  self-contained  elementary  account,  from  a  unified 
variational  point  of  view,  is  given  of  the  theory  of  self-adjoint  eiger value 
problems  with  discrete  spectra,  governed  by  linear  differential  equations 
of  the  form  M(y)  =  X  N(y).  The  theory  is  directly  relevant  for  the  various 
types  of  appr  rimate  energy  methods  applied  such  problems.  Included 
herein  are  statements  and  proofs  of  the  variational,  minimum,  and  maxi¬ 
mum-minimum  characterization  of  the  eigenvalues  in  all  modes.  Theorems 
based  on  both  the  Rayleigh  quotient  and  the  energy  quotient,  including  the 
role  of  natural  boundary  conditions,  are  developed.  In  addition,  existence 
proofs,  and  discussion  and  proofs  of  completeness  in  both  the  N-norm  and 
M-norm  are  given. 
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I.  INTRODUCTION 

1.  This  is  the  first  of  two  reports  in  which  the  purpose  is  to  present,  in  ns 
self-contained  a  manner  as  feasible,  ;he  theory,  with  applications,  of 
energy  methods  in  linear  self-adjoint  eigenvalue  problems  with  a  discrete 
spectrum.  The  present  report  deals  with  the  theory  of  the  eigenvalue  spect¬ 
rum  in  such  systems,  and  the  second  report  will  deal  with  the  theory  and 
applications  of  the  Ritz-Galerkin  and  related  energy  methods.  Attention 
will  be  focused  on  problems  characterized  by  ordinary  differential  equations 
of  the  form  M(y)  =  X  N(y),  where  M  and  N  are  real  linear  differential  opera¬ 
tors  to  be  defined  in  further  detail  subsequently,  y  =  y(x),  x  is  real,  and  X 
is  an  eigenvalue.  M  and  N,  and  the  associated  (homogeneous)  boundary  con¬ 
ditions,  will  be  assumed  such  that  the  problem  is  self-adjoint  and  with  a 
discrete  eigenvalue  spectrum.  Such  problems,  despite  their  apparently 
restricted  nature,  still  include  a  considerable  variety  of  specific  physical 
and  engineering  applications,  as  exemplified  in  the  book  of  Collatz1.  The 
prototype  of  such  problems  may  be  considered  to  be  the  free  vibrations  of 
a  Bernoulli-Euler  beam,  or  the  buckling  of  a  column.  The  present  report 
includes  the  following  items:  (1)  Variational,  minimum,  and  maximum- 
minimum  characterizations  of  the  eigenvalues,  and  relevant  implications 
of  these  characterizations.  (2)  Use  in  (1)  of  the  familiar  technique  of 
Lagrange  multipliers  to  establish  the  recursive  minimum  properties  of 
the  eigenvalues.  (3)  Theorems  relating  to  the  energy,  in  addition  to  the 
Rayleigh,  quotient,  and  the  role  of  natural  boundary  conditions  in  the  general 
case.  (4)  Proof,  by  the  use  of  minimizing  sequences,  of  the  existence  of 
admissible  minimizing  functions  for  the  general  eigenvalue  problem  M(y)  = 
XN(y).  Such  a  proof  is  especially  relevant  for  Ritz  type  of  energy  approxi¬ 
mation  methods.  (5)  Completeness  and  closedness  theorems  in  both  the 

Preceding  page  blank 


N-norm  and  M-norm.  Self-contained  proofs  will  be  furnished  throughout. 

In  view  of  the  long  and  continual  use  of  energy  methods  in  many 
types  of  applications,  it  appears  worthwhile  to  present  the  theory  of  such 
methods  at  least  for  the  class  of  problems  considered  herein.  Moreover, 
the  unified  variational  type  of  approach  taken  here  appears  especially  de¬ 
sirable  for  this  purpose,  since  this  is  essentially  the  approach  on  which  the 
energy  methc  1  s  can  be  most  directly  based,  at  least  in  the  self-adjoint  sys¬ 
tems  with  discrete  eigenvalue  spectra  considered  here.  As  will  be  seen, 
the  variational  approach  can  be  made  quite  elementary  for  this  class  of 
problems,  in  the  sense  that  only  concepts  familiar  from  "advanced  calculus" 
will  be  needed.  Even  such  an  elementary  approach  affords  an  opportunity, 
as  will  be  seen,  to  touch,  albeit  slightly,  on  such  fields  of  classical  and 
modern  interest  as  real  variable  theory,  calculus  of  variations,  approxima¬ 
tion  theory  and  functional  analysis.  Despite  the  existing  rich  literature  on 
self-adjoint  eigenvalue  problems,  there  is  no  single  reference  which  contains 
an  elementary,  essentially  self-contained  account,  from  a  variational  point 
of  view,  of  all  of  the  items  listed  in  the  preceding  paragraph.  Moreover, 
in  order  to  keep  the  exposition  simple,  self-contained,  but  essentially 
complete,  the  authors  have  furnished  their  own  proofs  or  modified  existing 
ones  in  various  instances  in  items  (1)  -  (5). 

In  connection  with  the  available  pertinent  literature,  it  is  noted  that 
the  theory  of  linear  self-adjoint  eigenvalue  problems  has  been  treated  in 
the  following  four  ways:  (a)  Differential  equations; (b)  Integral  equations! 

(c)  Operators  in  a  Hilbert  space;  (d)  Variational  principles.  These  ap¬ 
proaches  are,  of  course,  not  mutually  exclusive,  and  indeed  the  difference 
in  the  approaches  is  sometimes  mainly  a  matter  of  relative  emphasis. 

For  example,  (b)  is  based  on  the  well-established  Hilbert-Schmidt  theory 
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of  homogeneous  Fredholm  integral  equations  with  a  symmetric  kernel,  while 
(a)  is  based  essentially  on  tht  inverse  operator  of  M,  which  in  turn  involves 
p  c-perties  of  the  associated  Green’s  function.  The  kernel  in  (b),  however, 
is  closely  related  to  the  Green's  function.  The  approach  in  (c)  is  an  abstract 
generalization  of  the  integral  operators  in  (b)  to  operators  with  a  certain  set 
or  general  properties.  Even  (d),  which  means  here  an  approach  based  on 
extremal  properties  of  the  Rayleigh  or  related  energy  quotients,  ic  often 
treated  in  conjunction  with  (a)  -  (<  ),  especially  in  connection  with  existence 
theorems.  The  differential  equation  approach  is  given  in  Refs.  1-3=  Ref.  1 
also  gives  an  elementary  theory  (i.  e. ,  without  Green's  functions,  etc. )  for 
the  special  case  of  a  second-order  differential  equation  with  the  boundary 
conditions  y(a)  =  y(b)  =  0,  and  indicates  its  extension  to  a  particular  fourth- 
order  differential  equation  with  certain  boundary  conditions.  Ref.  3,  in 
addition  to  the  differential  equation  approach,  gives  a  self-contained  varia¬ 
tional  approach  based  on  the  theory  of  M.  Morse;  however,  both  of  the 
approaches  given  in  Ref.  3  are  in  rather  abstract  terms  in  a  highly  general¬ 
ized  setting,  and  the  variational  approach  there  appears  rather  different 
from  that  to  be  given  here. 

In  approach  (b)  tho  differential  equation  is  transformed  into  an  integral 
equation.  If  the  differential  equation  M(y)  =  \N(y)  is  self-adjoint,  and  N(y) 
is  of  the  one-term  type  (aB  defined  in  Ref.  1;  e.g. ,  N(y)  =  p  (x)  y)  then  the 
differential  problem  can  be  made  equivalent  to  finding  the  eigenvalues  of 
a  homogeneous  Fredholm  integral  equation  with  a  symmetric  kernel.  An 
exposition  of  this  integral  equation  problem  is  gi'ren,  for  example,  in 
Refs.  4-6.  A  Hilbert  space  approach  is  shown  in,  e.g.,  Refs.  7-9.  The 
variational  theory  for  self-adjoint  eigenvalue  problems  is  given  in  Refs.  4 
and  10  for  Sturm- Li ouville  (second-order)  differential  equations;  variational 


aspects  for  Sturm- Liouville  equations  are  also  included  in  Refs.  11  and  12. 

For  the  more  general  system  M(y)  =  \N(y),  a  variational  approach  is  shown 
in  Refs.  8  and  13.  It  is  noteworthy  that  only  Refs,  10  and  13  give  a  proof  of 
existence  of  the  relevant  minimizing  functions  by  variational  means,  i.e, 
by  means  of  minimizing  sequences.  Thus,  there  appears  to  be  no  such  type  of 

g 

proof  available  in  the  English-language  literature.  Even  Mikhlin  ,  who  gives 
a  comparatively  extensive  account  of  variational  principles,  refers  to  his 

5 

Integral  Equations  text  for  a  proof  of  a  compactness  property  which  is  as¬ 
sumed  in  Ref.  8  for  the  purpose  of  an  existance  proof,  Collatz\  for  the 
general  case  M(y)  =  \N(y),  proves  the  existence  of  an  admissible  minimizing 
function  for  the  Rayleigh  quotient  by  means  of  Green's  functions. 

It  is  remarked,  finally,  that  the  type  of  problem  to  be  considered 
here  is  a  generalization  of  Sturm- Liouville  problems.  It  is  however,  still 
fairly  .'estricted,  For  example,  the  literature  (including  some  of  the  refer¬ 
ences  cited  above)  now  contains  energy  principles  and  methods  for  complex 
operators,  non-self-adjoint  and/or  nonlinear  problems,  and  partial  differ¬ 
ential  equations.  Nevertheless,  or  perhaps  even  because  of  this,  it  is 
worthwhile  to  present  in  a  unified  review  a  reasonably  complete,  yet  elementary 
variational  theory  of  energy  methods  of  determining  eigenvalues  at  least  for 
the  class  of  problems  considered  herein. 


-fe¬ 
ll  .  VARIATIONAL.  THF.CRY  FOR  THE  RAYLEIGH  QUOTIENT 

2.  Self-adjoint  eigenvalue  problem*.  As  in  Refs.  U  2,  8  and  13, 
the  problem  will  be  considered  in  which  it  is  desired  to  find  all  the  eigen¬ 
values  of  X  for  which  a  nan-trivial  solution  exists  to  the  linear  homogeneous 
differential  equation 

M{y)  =  \N(y)  (lj 

subject  to  the  boundary  conditions 

B(y)  =  0  (2) 

Here,  M(y)  and  N(y)  are  assumed  of  the  form 

m 

M(y)=  2  (-1)*' [  f  (x>  y(v,(x>  ](v> 

V-  0 

N(y)  =  y  C-l)v[gv(x)y(v)(x)}(v) 

v~0  * 

where  f  (x)  and  g  (x*  are  given  (real)  v-times  differentiable  functions  in 
w  v 

[a,  b  ].  It  is  assumed  that  m  >  n  >  0.  The  boundary  conditions  (2)  may 
be  supposed  as  2m  (independent',  conditions,  each  of  the  form 
2m -1 

y  (aw  y{v)(a)  +  pv  y(v,(b)  )  =  0 

v-="o 

where  [a,  bl  is  the  domain  of  the  independent  variable  x,  and  a  and  p  are 
1  v  y 

given  constants  which  will  be  here  assumed  independent  of  X.  The  domain 
[a,  b]  throughout  this  analysis  will  be  assumed  finite.  For  convenience, 
functions  which  satisfy  all  of  the  boundary  conditions  and  have  continuous 
derivatives  through  order  2m  in  [a,  b]  will  be  called  ■  comparison*  functions 
(a  term  used  by  Collatz).  It  may  be  seen  that  the  problem  formulated  here 
includes,  for  example,  free  bend  ng  and  torsional  vibrations,  and  buckling, 


of  Bernoulli- Euler  beams,  and  is,  in  fact,  a  generalization,  of  second-order 
Sturm*  Liouville  problems. 
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for  ail  comparison  functions  u  and  v.  For  given  M  and  N  in  the  form  of 
Eqs.  (3),  it  will  be  found  by  successive  integrations  by  parts  that  the  self- 
adjointness  of  the  system  will  depend  only  on  the  boundary  conditions 
(cf.  Eq.  (3 4d)  below),  and  will  be  satisfied  under  a  variety  of  boundary  con¬ 
ditions  commonly  encountered  in  practice. 

3.  Orthogonality  of  the,  principal  modes.  If  the  eigenvalue  problem 
(l)-f2)  is  self-adjoint,  then  for  any  two  distinct  eigenvalues  X.  and  X^,  the 
corresponding  eigenfunctions  y.  and  y  will  satisfy  the  following  orthogonality 
relations: 

'My'i  >  Yj)  =  "  ;  ♦  (Tjt  Y^)  =  0  (8a,  b) 


The  proof  follows  by  first  writing  M(y.)  =  X^  N(y.),  M(y.)  =  X ^  N(y_). 
Multiply  the  first  equation  by  y.  and  integrate  over  [a,  b]  ;  multiply  the 
second  equation  by  y.  and  integrate  over  [a,  b].  Now  subtract  the  two  result¬ 
ing  equations.  Then  by  virtue  of  the  self-adjointness  the  left  side  vanishes, 
while  the  right  side  becomes  (X.  -  X..)  V (y.,  y.).  Thus,  Eq.  (8a)  follows. 

Eq.  (fib)  then  also  follows  from  either  of  the  two  equations  which  were  ori¬ 
ginally  subtracted  from  each  other. 

If  to  a  single  eigenvalue  X .  there  correspond  r  linearly  independent 
eigenfuuctions,  y.j,  ....  y^,  X^  is  said  to  be  of  multiplicity  r.  In  this  case, 
any  linear  combine*  on  of  the  eigenfunctions  will  also  be  an  eigenfunction 

corresponding  to  and  it  is  then  readily  possible  to  thus  combine 

1  4 

y.j,  . . . .  yir  (Gram-Schmidt  procedure  ’  )  to  obtain  r  mutually  orthogonal 
eigenfunctions  corresponding  to  X^. 

From  the  orthogonality  relation  (8a)  it  can  be  readily  proved  that 
if  the  eigenvalue  problem  (l)-(2)  is  self-adjoint,  and  N  is  positiv  definite, 
then  all  the  eigenvalues  must  be  real.  For,  if  X^  were  a  complex  eigenvalue, 
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with  corresponding  eigenfunction  y^,  then  from  the  realness  of  the  coeffi¬ 
cients  of  Eqs.  (1)  and  (2),  the  complex  conjugate  \j  =  would  also  be  an 
eigenvalue,  with  the  complex  conjugate  eigenfunction  y..  Eq.  (8a)  would 
then  imply  (Yj*  Yj  )  =  0,  Putting  y^  u  +  :v,  y^  =  u  -  iv  this  relation  be¬ 
comes  (u)  +  4*  (v)  +  i  [4*  (v»  u)  -  4<  (u»  v)  ]  =  0.  Since  u  and  v  must  each  be 
comparison  functions,  the  imaginary  part  of  the  left  side  of  the  letter 
equation  vanishes,  while  by  virtue  of  Eq.  (5b)  the  real  part  must  be  positive. 
Thus,  assuming  the  existence  of  a  complex  eigenvalue  leads  to  a  contradiction. 

4.  Rayleigh  quotient.  For  a  function  u(x),  the  Rayleigh  quotient 
R(u)  in  connection  with  problem  (1)  is  defined  by 

R(u)*  (u,  Mu)/(u,  Nu)  *  4>  (u) /  tjj  (a)  (9) 

If  y.  is  an  eigenfunction  of  Eq.  (1),  with  eigenvalue  \^f  andi|j(y.)  /:  0, 
it  is  easily  seen,  by  multiplying  both  sides  of  Eq.  (1)  by  y^  and  integrating 
over  [a,  b],  that 

>4  =  R(Yi)  =  *(yi)/4»(yi)  (io) 

It  may  be  noted  from  Eq.  (10)  that  if  M  and  N  are  both  positive  definite,  then 
all  the  eigenvalues  \ /ill  be  positive. 

5.  Variational  calculus.  The  few  elements  of  the  calculus  of  varia¬ 
tions  which  will  be  used  subsequently  here,  will  now  be  introduced, 

A  functional,  I(y),  is  a  quantity  whose  value  is  determined  by  a  func¬ 
tion,  y(x).  An  important  example  of  a  functional  is 

I(y)  =  J  F(x,y,  y' ,...,y^)dx  (11) 

a 

where  a  and  b  are  given  constants,  and  F  is  a  given  function.  Clearly  I 
becomes  determined  once  the  (k  times  differentiable)  function  y(x)  in  [a,  b] 
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is  specified.  The  quantities  $  (u),  (u)  and  R(u)  defined  by  Eqs.  (4)  and 

(9)  are  important  specific  examples  of  functionals.  We  shall  be  concerned 
with  "  admissible"  functions  y(x)  for  which  I(y)  is  "  stationary"  .  Quite 
generally,  an  "admissible"  function  is  any  function  obeying  certain  condi¬ 
tions  (usually  certain  types  of  continuity  and  differentiability  properties, 
in  addition  to  certain  boundary  conditions)  specified  in  advance.  To  define 
what  is  meant  by  a  stationary  I  for  some  admissible  function  y(x),  consider 
a  second  function  y(x)  +  8  y,  where 

5  y  (x)  =  er)(x)  ,  (12) 


e  is  a  small  parameter,  and  rj(x)  is  an  arbitrary  function,  independent  of  e , 
such  that  y  +  5y  remains  admissible.  6y  is  called  a  variation  in  y.  Due  to 
such  a  variation,  F  will  be  changed  by 

AF  =  F(x,  y  +  e  n(x),  ....  y^  +  €  n^(x)  )  -  F(x,  y,  . . . ,  y^k*) 


assuming  that  for  all  x  in  [a,  b]  F  is  expandable  in  a  power  series  in  e  with 
a  non-zero  radius  of  convergence.  The  (first)  variation,  6F,  in  F  is  now 
defined  by  the  terms  in  the  first  power  of  e  in  AF,  i,  e. , 


e  [  Av  n  +  n  +  *  •  *  +  ru-i  n  ' 


(13b) 


Similarly,  the  functional  I  will  be  changed  by  A  I,  and  this  change  to  first 


powers  of  €  is  defined  as  the  (first)  variation,  61,  in  I.  Thus,  if  the  limits 

a  and  b  in  (11)  remain  fixed, 

b 

51  =  /  (5F)dx 
a 


(14) 


-11- 


The  functional  I  is  called  stationary  at  the  function  y(x)  when  61  =  0  for 
arbitrary  (admissible)  variations  of  the  form  (12),  and  hence  for  arbitrary 
(but  admissible)  rj(x).  The  function  y(x)  is  then  called  an  n  extremal"  of  I. 

There  is  an  analogy  between  finding  the  minimum  (or  maximum)  of 
a  functional  I  for  admissible  functions  y(x),  and  finding  a  relative  minimum 
(maximum)  of  a  differentiable  function  f(x)  at  an  interior  point  of  a  region 
(x.,x.)  .  In  the  latter  case,  if  a  relative  minimum  occurs  at  a  point  x  =  x  , 

i  w  0 

one  considers  the  values  f(xQ  +  e  )  in  the  neighborhood  of  xq.  Expanding, 
f(xo  +  e  )  =  f(xo)  +  e  f'  (xo)  +  (€  2/  2! )  f"  (x,  +  £  O,  0  <  $  <  1.  If  f(xQ)  is 
to  be  a  minimum  in  the  neighborhood  of  xq,  then  f(xQ  +  e  )  >  f(xQ)  for  all 

i 

sufficiently  small  |e  }  /  0.  If  f(x^)  ^  0,  then  it  is  clear  that  (with  f "  finite) 
for  sufficiently  small  |e  |,  the  minimum  condition  will  be  violated  for  at 
least  some  range  of  values  of  e  .  Hence  a  necessary  condition  for  a  relative 
minimum  at  x  is  f*  (x  )  =  0.  It  is  noted  that  this  manner  of  obtaining  this 
xamiliar  necessary  condition  is  equivalent  to  expressing  Af  5  f(xQ  +  e  )  -  f(xQ) 
to  first  powers  of  e  ,  and  equating  the  result  to  zero,  with  e  f  Of  that  is,  it  is 
equivalent  to  requiring  6f  =  0,  Analogous  considerations  hold  for  finding  the 
absolute  minimum  of  a  functional.  Suppose  y(x)  is  an  admissible  function 
for  which  I  attains  a  minimum.  Then  I(y  +  en(x))>  I(y)  for  all  e  and  any 
arbitrary  admissible  function  r](x).  If  one  expands  AI  sl(y  +  e  n(x)  )  -  I(y) 
in  powers  of  f ,  it  is  seen  that  a  necessary  condition  that  AI  >  0  for  (at 
least)  sufficiently  small  |e  |  ^  0  is  that  the  coefficient  of  e  vanish;  this, 
in  turn,  is  the  condition 

I 

61  =  0  (15) 

Thus,  a  necest  ry  condition  that  I  be  a  minimum  for  some  admissible  func¬ 
tion  y(x)  is  that  its  variation,  61,  vanish  for  (first  order)  small  arbitrary 
admissible  variations  6yJ  that  is,  I  must  be  stationary  at  y(x). 
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In  actual  evaluations  of  SI,  it  is  useful  to  note  that  in  general, 
d/  dx  (5y)  =  d/  dx  (en(x))  =  ctj1  (x).  But  also,  6(dy/  dx)  =  [y(x)  +  erj(x)3'  - 
y’  (x)  =  er)'  (x).  Hence 

d /  dx  (5y)  =  6(dy/  dx),  or  d(6y)  =  6(dy)  (16) 

To  familiarize  the  reader  with  the  variational  process  and  the  6 
notation,  suppose  it  is  required  to  find  a  differentiable  function  y(x)  (  j£  0) 
such  that 

y(0)  =  y(l)  =  0  (17a) 

and  such  that 

1  >  1  ? 

IS  L  •  \  I  =  f  (x  +  1)  y*  *  dx  -  X  f  y  dx  (17b) 

1  2  J0  J0 

rl 

be  stationary,  where  X  is  a  constant.  First,  it  is  noted  that  61.  =  I  (x  +  1) 

1  ‘0 

6(y'  2)  dx,  since  (x  +  1)  is  a  fixed  function  in  the  variation.  Moreover, 

2 

6(y'  )  =  2y’  6y' ,  since,  by  definition,  only  "  infinitesimally  small"  varia- 

2 

tions  are  considered,  i,  e. ,  the  change  here  in  y'  is  to  be  found  only  to 

first  powers  of  €  .  Thus,  one  finds  8L  =  2  (x  +  1)  y*  6y*  dx.  Now  inte- 

1  0 

grate  by  parts,  with  u  =  (x  +  1)  y* ,  dv  =  6y'  dx  =  6(dy)  =  d(6y)  ;  hence 

1  r1 

v  =  6y.  Then  one  finds  6  I,  =  2(x  +  1)  y’  6y  ]  -  2  f  [  (?.  +  1)  y'  ]  6y  dx. 

1  0  Jo 

Since  y  and  (y  +  8y)  must  satisfy  conditions  (17a),  it  follows  that  6y  =  0  at 

1 

x  =  0  and  at  x  =  1.  Consequently,  6L  =  -  2  f  [  (x  +  1)  y'  ]  6y  dx.  In  addi- 
1  -  1  1  0 

tion,  61,  -  f  8  (y^)  dx  =  2  f  y  6y  dx.  Thus,  one  finally  obtains 
&  J0  0 

1  »  ) 

61  =  2  J  [  (x  +  1)  y'  )  -  Xy  ]  6  y  dx  I  (17c) 

In  order  that  51  ~  0  for  arbitrary  (admissible)  8y(x),  it  follows  from  (17c) 
that  it  is  necessary  and  sufficient  that 
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(  (x  +  1)  y*  )  -  \y  -  0 


(17d) 


Thus,  y  must  be  a  solution  of  the  (here  of  Sturm- Li ouville  type)  differential 
equation  (17d)  and  satisfy  the  boundary  conditions  (17a).  It  is  thus  clear 
that  the  variational  problem  (17c),  (17a)  is  equivalent  to  the  differential- 
equation  problem  (17d),  (17a);  a  solution  of  either  problem  is  a  solution  of 
the  ot1  er.  In  this  example,  X  will  actually  be  an  eigenvalue,  since  it  would 
be  required  to  find  the  value(s)  of  X  (if  any)  for  which  non-trivial  solutions 
of  (17d)  ,  (17a)  exist. 

The  following  type  of  variational  problem  with  constraints  will  also 
be  of  importance  here.  Let  I(y)  be  defined  by  Eq.  (11),  where  the  limits  a,  b 
are  fixed.  The  functions  y  are  now  constrained  not  only  to  be  admissible 
(as  before)  but  also  to  satisfy  s  (*  isoperimetric*  )  constraint  conditions  of 
the  form: 


G .  =  f  g-  (x,  y, y' . y*kbdx  =  c.  (j  =  l, . . . ,  s) 

J  Ja  J  J 


(18a) 


where  the  are  given  functions,  and  the  c^  given  constants.  Among  all 
such  functions  y,  find  that  for  which  I(y)  is  a  minimum.  From  (18a)  the 


variations  in  y  must  satisfy  the  conditions 


6  G.  *  f  (6  g.)  dx  =  0  (j  =  1,  . . . ,  s) 

J  a  J 


(18b) 


As  in  the  problem  without  the  constraints  (18a),  a  necessary  con¬ 
dition  on  y(x)  is  still  that  it  make  I  stationary,  but  now  under  variations 
which  satisfy  (18b),  This  problem  is  analogous  to  finding  the  minimum  of 
a  (differentiable)  function  f(x^,  . . . ,  x^),  where  the  x^  are  subject  to  constraints 
of  the  form  In  (x^,  . .  . ,  x^)  =  c^,  i  =  1,  . . . ,  s  ;  (s  <  p).  In  particular,  the 


i„e.  k-times  differentiable  and  satisfying  the  boundary  conditions. 
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problem  (15),  (18b)  can  be  solved  by  introducing  Lagrange  multipliers 

K_, . . k  ,  which  for  the  isoperimetric  constraints  will  be  constants  (as 
i  8 

yet  unknown),  and  requiring 

6  (I  +  k1  Gj  +  . . .  +  k  s  G  g)  =  0  (18c) 

without  constraint  conditions  (except  for  admissibility)  on  y.  Suppose  the 
general  solution  of  (18c)  for  admissible  y  is  obtained.  This  solution  will 
depend  on  the  constants  K  y  . . . ,  K The  latter  are  then  determined  so  that 
the  s  constraint  conditions  (18a)  hold.  If  such  constants  exist,  the  resulting 
y  is  then  the  desired  solution.  For,  first  of  all,  it  satisfies  (18a).  More¬ 
over  (18c)  is  satisfied  for  arbitrary  (admissible)  variations  8y,  and  hence 
it  will  also  be  satisfied  for  those  particular  6y  which  satisfy  (18b).  However, 
for  such  variations,  (18c)  implies  that  (15)  will  hold.  Thus,  y  is  a  solution 
of  the  problem  (15)  under  variations  satisfying  (18b).  Conversely,  any  solu¬ 
tion  y,  of  the  problem  (15),  (18b)  must  also  be  a  solution  of  (18c)  with  the  k  ^ 
as  determined  above.  For,  such  a  y  will  satisfy  (18c)  under  the  constraint 
conditions  (18b),  for  any  values  of  the  k..  Therefore,  such  a  solution  must  be 
a  subset  of  solutions  of  (18c)  without  any  constraints  on  y.  Thus,  this  solu¬ 
tion  must  be  included  in  the  general  solution  of  (18c)  for  any  K  and  must 
hence  be  the  solution  of  (18c)  for  those  for  which  (18a)  is  satisfied. 

6.  Stationary  properties  of  the  eigenvalues.  It  will  now  be  shown 
that  if  the  problem  (l)-(2)  is  self-adjoint,  then  any  eigenfunction  y.,  with 
corresponding  eigenvalue  \  =  V,  is  such  that  the  quantity  ($  (y)  -  X.  (y)  ) 
will  be  stationary  at  y  =  y^  with  respect  to  comparison  functions  y. 

The  proof  can  be  carried  out  rather  easily  by  first  noting  that  since 
M  and  N  are  linear  operators 

8  M(y)  =  M(6y)  ,  8  N(y)  =  N  (6y)  (19) 
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From  Eqs.  (19)  and  (7)  it  follow*  that 

b  „b  b 
J  yS  M(y)  dx  =  j  y  M(5y)  dx  =  J  M(y)  6y  dx. 

a  a  a 

Consequently,  from  (4), 

b  b  b 

5  $  (y)  =  J  y6  M(y)  dx  +  J  M(y)  5y  dx  =  2  f  M(y)  5y  dx  (20a) 
a  a  ‘a 

Similarly, 

b 

5  tMy)  =  2  J  N(y)  gy  dx  (20b) 

J  a 

Equations  (19)  and  (20)  hold  for  any  comparison  function  y  and  (first-order 
small  comparison)  variations  6y.  Suppose  now  that  y^is  an  eigenfunction  of 
(1),  (2)  with  corresponding  eigenvalue  Multiply  Eq.  (1 ),  with  y  =  y^ 

X  =  \^,  by  6^  and  integrate  with  respect  to  x  over  [a,  b],  Then  by  virtue 
of  Eqs.  (20a,  b)  it  is  immediately  seen  that  the  variational  equation 
(1/  2)  (6  $  (y .)  -  X.  8  4»  (v^)  =  0  is  obtained.  With  X  =  X^  kept  fixed  in  these  varia¬ 
tions,  the  variational  condition  can  be  written  as; 

8$-X64i=6($-X4')  =  0  (21) 

This  proves  the  theorem. 

The  ccnverse  of  the  above  theorem  also  holds.  That  is,  if  a  com¬ 
parison  function  y  and  X  are  such  that  the  variational  condition  (21)  holds 
for  arbitrary  comparison  variations  6y,  then  y  and  X  are  an  eigenfunction 
and  corresponding  eigenvalue  of  the  problem  (1),  (2)  when  the  latter  is  self- 
adjoint.  This  follows  by  noting  that  Eq.  (21)  in  conjunction  with  Eqs,  (20a,  b) 
will  imply 

b 

J  (M(y)  -  X  N(y))8y  dx  =  0  (22) 

J  a 

Since  (except  for  being  a  comparison  function)  6  y(x)  is  arbitrary,  this 
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implies  Eq.  (1),  and  the  theorem  is  proven. 

It  is  now  noted  that  when  4<  (u)  4  0,  Eq.  (21)  is  equivalent  to  the 
condition 

5R  =  0  (23) 

where  R  is  defined  by  Eq.  (9)  and  the  variations  are  with  respect  to  com- 

2 

paris on  functions.  This  follows  by  simply  noting  that  6R  =  (l/^  )• 

[4*  (u)  5*  -  $  (u)  8  i|i]  =  (1/44  [8$  -  R  64*3-  The  condition  (21),  with  \3  R, 
is  hence  equivalent  to  condition  (23). 

Thus,  if  the  problem  (l)-(2)  is  self-adjoint,  the  eigenfunctions  are 
the  comparison  functions  for  which  the  Rayleigh  quotient  R  assumes  station¬ 
ary  values,  and  the  corresponding  eigenvalues  X.  will  be  the  corresponding 
stationary  values  of  R. 

The  above  stationary  property  of  the  eigenvalues  is  already  sufficient 
to  indicate  certain  computational  procedures  by  energy  methods,  such  as 
Galerkin' s  method,  to  obtain  at  least  approximately,  the  eigenvalues  and 
eigenfunctions  of  (l)-(2).  This  will  be  discussed  in  a  subsequent  report. 

It  should,  however,  be  noted  that  these  stationary  properties  do  not  as  yet 
establish  the  minimum  characterizations  of  the  eigenvalues.  The  latter 
properties  will  now  be  established. 

7.  Existence  of  a  greatest  lower  bound  of  R.  It  will  now  be  assumed 
that  in  addition  to  being  self-adjoint,  the  operators  M  and  N  are  respectively 
positive  semidefinite  and  positive  definite,  i.  e. 

$  (u)  >  0  ,  ip  (u)  >  0  (24a) 

for  any  comparison  function  u  not  identically  zero.  It  then  follows  that 


R  (u)  >  0 


(24b) 


Thus,  the  Rayleigh  quotient  R(u)  has  a  lower  bound  in  this  case,  and  hence 
it  must  have  a  greatest  lower  bound,  B  (say).  Clearly,  B  >  0. 

8.  Minimum  characterization  of  the  lowest  eigenvalue.  The  follow¬ 
ing  theorem  can  now  be  readily  proven.  Let  the  eigenvalue  problem  (l)-(2) 
be  self-adjoint.  Moreover,  let  the  Rayleigh  quotient  R(u)  for  comparison 
functions  u  have  a  greatest  lower  bound  B,  as  is  the  case, for  example, 
when  the  inequalities  (24a)  hold.  Finally,  let  there  exist  a  comparison 
function  u  =  yj  for  which  R(y^)  =  B  Then  the  lowest  eigenvalue  of  (l)-(2)  will 
be  X  =  Xj  =  B,  with  y  =  y1  a  corresponding  eigenfunction. 

The  proof  follows  readily  by  observing  that  under  the  conditions  of 
the  theorem,  R(u)  attains  a  minimum  when  u  =  y^ ,  and  hence  the  first  varia¬ 
tion,  5R,  in  R  vanishes  for  arbitrary  comparison  variations,  6u,  in  y^ 

(Sect.  5).  It  then  follows  from  the  stationary  properties  of  the  eigenvalues 
established  in  Sect.  6,  that  u  =  yj  is  an  eigenfunction.  FromEq.  (10),  the 
corresponding  eigenvalue  is  X^  =  R(y^).  Since  R(y^)  =  B,  X^  =  B.  The  fact 
that  B  is  the  lowest  eigenvalue  follows  from  the  fact  that  if  X^  is  any  eigen¬ 
value,  with  eigenfunction  y^,  then  from  Eq.  (10),  X.  =  R(y^).  But 
R  (yj )  >  B .  .  ’ .  X  i  >  B . 

If  yi  is  any  eigenfunction  of  (l)-(2)  then  C.  yi  is  also  an  eigenfunction, 
where  Ch  is  an  arbitrary  constant.  When  i|;  (y.)  ^  0  it  is  therefore  possible 

to  "normalize"  the  eigenfunctions  by  specifying,  for  example,  that  vp  (y. )  =  1. 
Under  the  conditions  of  the  theorem  just  proven  the  lowest  eigenvalue  Xj 
can  then  be  characterized  as  the  minimum  value  of  $(u)  with  respect  to  all 
comparison  functions  u  for  which 


4j(u)  =  1 


(25) 
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3.  Recursive  minimum  characterization  of  the  higher  eigenvalues. 

The  higher  eigenvalues  of  (l)-(2)  can  be  characterized  by  'he  following  theorem 
Let  the  conditions  of  the  precedes  theorem  hold.  In  addition,  suppose  there 
exists  ?  comparison  function  u  =  satisfying  (25).  orthogonal  to  the  first 
(k  -  1)  eigenfunctions  y, . j,  in  the  sense  of 

Hu.  y.)  =  0  (i  =  -  1)  (26) 


and  minimizing  the  integral  9  (u)  with  respect  to  comparison  functions  u  subject 
to  the  constraints  (25)  and  (26).  Then  is  an  eigenfunction,  with  the  cor¬ 
responding  eigenvalue  =  4(1^).  V  is  the  next  higher  eigenvalue. 

This  theorem  can  be  proven  using  the  familiar  technique  of  Lagrange 
multipliers  (Sect.  5).  Thus,  introducing  Lagrange  multipliers  k^,  Kj,  ....  ^ 

the  conditions  of  the  theorem  imply  6  [  4(u)  -  Kk  >l»(u)  -  i  (u,  y^l  -  • 

-  *k  ,  i  (u,  y^  j)  1  ~  0  when  u  =  u^.  Using  Eqs.  (20a,  b)  and  (4)  this  is  found 
to  imply :  2  (Su,  (M(u)  -  *  N(u)  j  )  -  di  (Su,  yj  -  ...  -  k^j  Cj  (6u,  y^  j)  =  0. 


when  u  sr  v*^  I  hence,  rince  Su  is  arbitrary 


2  [  M  (u^)  -  ^ x  N'(yj)  +-  -  •+  "k_, 


(27) 


Multiply  both  sides  of  Eq.  (27)  by  y..  (i  =  1,  . . . ,  k-1)  and  integrate  over  [a,  b]. 

Then  by  virtue  of  Eqa.  (8a)and  (7)  it  is  found  that  k  i  (y.)=  2®  (u^,  y.)  - 

2  K  4»  (u,  y.).  But  since  M(y. }  =  X.  *f(y. ),  the  preceding  equation  implies 
K  K|  1  1  1  4 

Ki  4»  (y J  =  2(\  ^  -  k^Hu^,  yj)«  From  Eq.  (26),  and  <j  (y. )  >  0,  it  then 

follows  that  k.  =  0  (i  =  1, _ k-1).  Hence  Eq.  (27)  yields  M(u)  =  Kk  Nfc^). 

This  proves  that  is  an  eigenfunction,  with  corresponding  eigenvalue 
X  =  Applying  this  result  recursively  to\^,\y...  it  is  seen  that 

each  Xk+1  >  X  k(k  =  1,  2,  . . .  ).  For,  Xk  =  min  9  (u)  with  u  subject  > 

(25)  and  to  fewer  of  the  orthogonality  conditions  in  (26)  than  Xk+j. 


Thus 
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the  set  of  permissible  functions  for  is  only  a  subset  of  those  for  ; 

hence  X-  . 

k+1  ~  k 

It  remains  to  prove  that  with  the  X^  thus  characterized  recursively, 
there  can  be  no  eigenvalue  between  X^  and  j.  This  will  then  show  that 
the  characterization  of  the  eigenvalues  given  in  the  theorem  of  this  section 
exhausts  all  the  eigenvalues.  It  will  further  show  that  the  eigenvalues  form 
a  discrete  spectrum.  Consider, then,  an  eigenvalue  X'  greater  than  ji 
with  eigenfunction  u*  satisfying  (25).  Then  u*  must  he  different  from 


y?»  Yy  -  •  •  y. 


Hence  by  Eq.  (8a),  u’is  orthogonal  to  y, ....  y  .  Thus 


u'  satisfies  the  same  constraint  conditions  as  u^.  But  by  Eq.  (10), 

X'  =  9  (u*  ).  Hence  from  the  present  characterization  of  \^,  X^  <  X'  . 

Thus,  there  is  no  eigenvalue  between  X^  ^  and  X^. 

The  recursive  minimization  theorem  implies  that  (when  it  holds) 
the  set  (spectrum)  of  eigenvalues  will  be  infinite  in  number  (as  well  as  dis¬ 
crete).  Finally,  it  is  noted  that  the  recursive  minimization  theorem  can 
also  be  formulated  without  the  normalization  requirement  (25)  by  replacing 
$(u)  in  the  theorem  by  R(u). 

10.  Maxim im- minimum  theorem  and  some  implications.  The  pre¬ 
ceding  recursive  minimum  property  of  X^  depends  on  the  lower  eigenfunctions 
y  ,  . .  . ,  y  .  It  is  possible,  however,  to  characterize  the  k*  th  eigenvalue 
independently  of  the  lower  eigenfunctions,  by  means  of  a  maximim-minimum 
theorem,  first  emphasized  by  Courant.  This  theorem  can  be  stated  as 
follows.  Consider  all  comparison  functions  u  which  are  orthogonal  to 


each  of  (k-1)  given  functions  crj 


k  ^that  is  (u,  N  (cp  j))  =  0  (i  =  1,  .  .  . ,  k-1). 


Consider  now  the  minimum,  R  .  ,  with  respect  to  u,  of  the  Rayleigh  quotient 

mm 

(9).  Then  the  k' th  eigenvalue  X^  is  the  maximum  value  of  Rin-n  which  can 
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be  obtained  by  varying  the  function*  <p  ^,  .  .  .  • 

This  theorem  can  be  proven  by  first  choosing  a  function  in  the  form 

u  =  ^  a.  v.  .  where  y.  is  the  i' th  normalized  eigenfunction.  A  set  of  a., 

°  i=l  1  1  1 

not  all  zero,  can  be  found  such  that  (u  ,  N  <p  .)  =  0,  j  =  1,  .  . . ,  k-1.  For, 

o  J 

these  conditions  will  lead  to  (k-1)  linear  homogeneous  algebraic  equations 
in  k  unknowns  (a^,  ....  a^).  Such  a  set  of  equations  will  always  have  a  non¬ 
trivial  solution  (which  need  not  be  unique).  From  the  linearity  of  M  and  N, 

k  k 

and  Eqs.  (Sa,  b),  one  then  finds  R(u  )  =  £  a^  $  (y.)/  Y  af  .  But  from 

0  i=i  k  i , 

Eqs.  (10)  and  (25),  $(y.)  =  X.,  Hence  R(u  )  =  V  a,  X  /  Y  a  .  This  is  a 

i  i  °  itl  1  1  i=l  1 

weighted  average,  with  the  non-negative  weights  a^,  of  X^,  .  .  .,  X^.  Hence 

R(Uq)  <  X^,  and  thus  Rmin  <  X^.  If  the  functions  <p  . ,  cp  are  now 

permitted  to  vary,  it  is  known  from  the  recursive  minimum  property  of 

X,  that  R  .  =  X,  when  cn  .  =  y..  Thus,  the  maximum  value  of  R  .  obtained 

k  min  k  ^  l  l  min 

by  varying  cd  p  •  •  • » ®  is  ^ 

It  may  be  noted  that  the  max-min  theorem  can  be  stated  with  the 
orthogonality  conditions  on  the  comparison  function  u  in  the  form  (u, cp  .)  =  0 
(i  =  1,  .  .  . ,  k-1).  The  proof  would  proceed  exactly  as  above,  except  that  in 
the  final  step  it  is  noted  that  now  =  Xk  when  cp  i  =  N(y,). 

The  following  implication  of  the  max-min  theorem  should  be  noted. 
Consider  two  linear  self-adjoint  systems  S  and  S’  characterized  by  Eqs. 
(l)-(3),  and  assume  for  both  sufficient  conditions  for  which  the  preceding 
minimum  and  max-min  theorems  will  hold.  Let  the  boundary  conditions 
be  the  same  for  S  and  S' ,  but  let  the  coefficients  in  S'  be  such  that 
R(u)  <  R'  (u)  for  all  comparison  functions  u.  Then  X^  <  X^  *n  each  mode 
(k),  (In  actual  applications  of  this  principle  the  Dirichlet  forms  of  (y) 
and  ij;  (y),  defined  in  Eqs.  (28)  below,  are  often  useful).  This  is  proven  by 


first  noting  that  among  comparison  functions  of  a  given  set, 


K 


.  <  R'  .  . 

min  —  min 


For,  let  u  be  the  function  of  the  set  for  which  R  is  a  minimum,  and  let  u' 

be  the  function  of  this  set  for  which  R'  is  a  miminum;  then  R  =  R(u)  < 

mxn  — 

R{u*  )  <  R*  (u*  )  =  RJ^h*  This  immediately  shows  that  X^  <  X'^.  Moreover, 

for  the  higher  modes,  let  v  by  any  comparison  function  orthogonal  (with 

the  operator  N)  to  the  first  (k-1)  eigenfunctions  of  S.  Th^n  X,  =  min  R(v) 

k  v 

<  min  R'  (v).  But  by  the  max-min  theorem  for  S'  ,  min  R*  (v)  <  XJ  . 

—  v  v  —  k 

Hence  X^  <  X^  .  One  application  of  this  theorem  is  furnished  by  the 
enclosure  theorem  of  Collatz  [Ref.  1,  pp.  131-137], which  is  a  generalization 
of  that  first  given  by  Temple. 

A  second  important  general  implication  of  the  max-min  theorem  is 
the  following.  If  two  self-adjoint  systems  S  and  S'  satisty,  as  above,  condi¬ 
tions  for  the  existence  of  the  minima  of  R,  and  are  the  same  except  that  S' 
is  more  constrained  than  S,  meaning  that  the  class  of  comparison  funticns 
for  S*  is  only  a  subset  of  that  for  S,  then  the  eigenvalues  of  S'  will  be  at 
least  as  large  as  the  respective  eigenvalues  of  S.  Proof:  Let  u  be  any  com¬ 
parison  function  for  S,  and  u'  any  comparison  function  for  S’  ,  each  ortho¬ 
gonal  to  the  first  (k-1)  eigenfunctions  of  S.  Then,  since  the  class  of  u'  is  a 

subset  of  the  class  of  u,  min  R(u)  <  min  R  (u1  ) .  But  X,  =  min  R(u).  More 

u  —  ui  k  u 

over,  by  the  max-min  theorem  for  rf' ,  Xj|  >  min  R(u'  ).  Hence  >  X^. 
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III.  THE  ENERGY  QUOTIENT 


11.  Dirichlef  forms.  The  various  extremum  theorems  of  Ch.  II 
concern  the  Rayleigh  quotient  R(u),  defined  by  Eq.  (9),  where  u  is  a  com¬ 
parison  function.  There  is,  however,  a  related  quotient,  to  be  called  the 
"  energy  quotient"  ,  which  also  plays  an  important  role  both  in  the  theory  and 
in  applications.  Before  introducing  this  quotient,  it  is  observed  that  with 


M(y)  and  N(yl  given  as  in  Eqs.  (3),  successive  integration  by  parts  will 
yield  the  following  forms  for  $(y)  and  4*  (y ): 


b  111  2 

*(y)  s  f  [  i  *  (y(v))  1 dx  +  m  (y)  \ 

a  v  =  0  v  °  I 

I  (28) 

b  n  ,  ,  2  ( 

+  (y)  =  /  [  Tj  gv  (y  v;)  1  dx  +  N  (y)  I 

a  v  =  0  f 


where  MQ  and  N  are  quadratic  forms  in  the  end-point  values  of  y  and  its 
derivatives  up  to  the  (2m-l)th  and  (2n-l)th  orders,  respectively.  Eqs.  (28) 
are  called  Dirichlet  forms  of  $(y)  and  4Ky)»  and  hold  for  all  (suitably  differ¬ 
entiable)  y. 

If  the  boundary  conditions  (2)  make  the  system  self-adjoint,  and  if 
y  is  a  comparison  function,  then  it  will  be  found  that  the  boundary  terms 
Mq  and  Nq  can  in  general  be  written  respectively  as  quadratic  form^in 
the  end-point  values  of  y  and  its  derivatives  up  to  only  the  (m-l)th  and 
(n-l)th  orders.  A  detailed  proof  of  this  is  given  in  Ref.  13.  A  comparatively 
simple,  indirect  type  of  proof  can  also  be  given,  using  the  variational  prop¬ 
erties  established  in  section  6.  Thus,  from  Eq.  (20a),  for  comparison 
functions  y, 

b 

j  M(y)  5y  dx  =  6(*(y)/2) 
a 


(29) 


-23- 


However,  with  M(y)  given  by  Eq.  (3),  the  left  side  of  Eq.  (29)  can  also  be 
evaluated  by  successive  integration  by  parts  (noting,  e.g.  ,  Eq.  (16)).  This 
will  yield  the  form: 

b  b  m  2 

f  M(y)6ydx  =  6  [  (1/2)  f  (  V  f  (y(v))  )  dx  ] 
a  a  v=  0 


m-i  h 

+  [  L  A  (y)  6y(p)  ]  (30) 

p  =  0  ^  a 


where 


III 

a„  (y)  =  L  (-D  p+i  (f.  ri))(U1'p) 

f  i=p+i  1 


Similar  equations  hold  with  M(y)  replaced  by  N(y),  and  m,  ft  and  A^  replaced 
by  n,  g^  and  B  .  Eqs.  (30)  and  (31)  hold  for  any  suitably  differentiable  func¬ 
tions  y.  From  Eq.  (29)  it  follows,  however,  that  if  y  is  a  comparison  func¬ 
tion,  then  the  right  side  of  Eq.  (30)  must  be  expressible  as  the  exact  vaca¬ 
tion  of  some  functional;  in  fact,  comparing  Eqs.  (29)  and  (30),  and  noting 
Eq.  (28),  it  is  seen  that  for  comparison  functions  y, 
m-1  b 

2[  £  A  (y)  6  y(p)]  =  6  M  (32) 

p  =  0  p  a  ° 

Since,  however,  5y^  appears  only  for  p  <  m  -  1  in  the  left  side  of  Eq. 

(32),  it  follows  that  in  any  self-adjoint  case  Mq  (and  hence  also  the  A  ) 
must  be  expressible  in  terms  of  only  the  y^  ,  p  <  m-1,  when  account  is 
taken  of  all  the  specific  boundary  conditions. 


12.  Natural,  geometric  and  dynamical  boundary  conditions.  For  a 
given  variational  problem,  an  "  admissible"  function  may  be  defined,  quite 
generally,  as  any  function  among  the  permitted  class  of  competing  functions 
for  that  problem.  An  admissible  function  for  the  Rayleigh  quotient  in  con¬ 
junction  with  the  self-adjoint  system  (l)-(3),  for  example,  would  be  any 
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comparison  function  (as  previously  defined).  A  "  natural"  boundary  condi¬ 
tion  for  a  given  variational  problem  may  be  defined,  in  general,  as  one 
■which  would  always  be  automatically  satisfied  by  a  solution  of  the  problem, 
although  the  admissible  functions  are  not  constrained  to  satisfy  this  con¬ 
dition  in  advance.  A  variational  problem  equivalent  to  a  given  differential 
system  (such  as  (l)-(3))may  often  be  modified  to  enlarge  the  class  of  admis¬ 
sible  functions  by  modifying  the  functional  to  be  stationary.  This  will  be 
illustrated  here,  and  more  generally  in  the  next  section,  by  an  important 
and  well-known  principle  for  the  system  (l)-(3). 

In  connection  with  the  system  (l)-(3)f  it  will  be  convenient  to  define 

"geometric"  and  "  dynamical"  boundary  conditions,  a  terminology  due  to 

14 

Biezeno  and  Grammel  .  Let  the  maximum  number  of  boundary  conditions 
in  Eqs.  (2)  that  can  be  formed  to  contain  only  y  and  its  derivatives  of  order 
(m-1)  or  lower  be  k.  Then  these  k  conditions  will  be  called  the  "  geometric" 
boundary  conditions.  The  remaining  (2m-k)  conditions  will  include  deriva¬ 
tives  of  order  m  or  I4'  3r,  and  will  be  called  the  "dynamical"  boundary 
conditions.  Geometrical  and  dynamical  boundary  conditions  are  also  some¬ 
times  called  respectively  "essential"  and  "  suppressible"  ,  or  "remaining", 

13 

after  Kamke  .  It  will  be  shown  in  Section  13  that  by  introducing  a  modified 
quotient  (the"energy  quotient"  )inplaceof  the  Rayleigh  quotient,  a  variational 
problem  equivalent  to  (l)-(3)  can  be  formulated  in  which  the  admissible 
functions  need  only  satisfy  the  geometric  boundary  conditions.  The  dynami¬ 
cal  boundary  conditions  will  in  this  case  be  automatically  satisfied  by  any 
solution  of  the  problem,  and  will  therefore  be  the  natural  boundary  condi¬ 
tions  of  the  problem.  This  has  been  shown  for  second-order  (Sturm- Liouville) 

4  11 

systems  by  Courant  and  Hilbert  ,  Sagan  ,  and  many  others,  Biezeno  and 
14 

Grammel  have  shown  this  for  fourth-order  (beam)  systems,  and  an  extension 
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to  general  self-adjoint  systems  of  2m‘ th  order  has  been  given  by  Kamke^. 

A  simplified  treatment  for  this  general  2m' th  order  case  will  be  given  in 
Section  13. 

To  illustrate  the  statements  in  the  preceding  paragraphs,  as  well 
as  the  more  general  analysis  to  be  given  in  the  succeeding  Section,  consider 
a  beam  of  length  L  governed  by  the  equation 


(p(x)  y"  )  =  X  p(x)  y 


(33a) 


(p(x),  p(x)  >  0).  By  successively  integrating  by  parts,  it  is  found  that  for 
any  (suitably  differentiable)  function  u(x)  and  (small)  arbitrary  variations 
5u, 


f  [  (pu"  )  -  X  pu  ]  6u  dx  =  6  [  f  (pu"  c  /  2)  dx  -  X  f  { pu L /  2)  dx  ] 

0  J0 


+  (pu"  )  Su 


pu"  Su1 


L 

0 


(33b) 


Suppose  that  the  boundary  conditions  are: 

y(0)  =  y'  (0)  =  0;  (py»  )'  (L)  =  (3y(L)  ;  py"  (L)  =  -  ay'  (L) 

(fixed -elastic  ends)  and  consider  now  only  functions  u  which  satisfy  the  geo¬ 
metric  boundary  conditions  u(0)  =  u'  (0)  =  0.  Moreover,  in  view  of  the 

given  dynamical  boundary  conditions,  put  pu"  =  -  a  uT  ,  (pu"  )  =  p  u. 

u  •Lj  L 

in  the  boundary  terms  of  (33b).  Then  from  (33b),  Eq.  (33a)  is  thus  seen 
heuristically  to  lead  to  the  following  va-iational  problem: 


6  [  f  pu"  2  dx  +  pu“  +au'f  •  X  (  p  u2  dx  ]  =  0 
J  0  h  J0 


(33c) 


where  the  admissible  function  u  is  any  function  with  continuous  derivatives 
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✓ 


in  [0,  L]  through  order  4  which  satisfies  the  given  geometric  boundary 
conditions  u(0)  =  u'  (0)  =  0.  It  will  now  be  shown  that  (33c)  is  indeed  equiva¬ 
lent  to  the  original  problem. 

From  '33b)  it  follows  that  for  admissible  functions  u,  i.e.  satisfying 
here  u(0)  =  u'  (0)  =  0,  Eq.  (33c)  implies 


J  [  (pu"  )  -  \pu  ]5udx  +  [fh* **L-  (pu"  )'L  ]  6ul  +  [au'L  +  Px’l  1  6U*L  =  0 

(33d) 

If  5u(x)  is  arbitrary  in  (33d)(except  for  being  admissible)  and  6uT  ,  6u' 

Lj  jL 

are  arbitrary  ,  and  independent  of  each  other,  then  (33d)  implies  Eq.  (33a) 
together  with  the  dynamical  boundary  conditions  pu'  =  -  a  u'T  ,  (pu!l  )'  =  puT  . 
The  latter  are  thus  the  natural  boundary  conditions  in  the  variational  prob¬ 
lem  (33c).  Condition  (33c)  is  equivalent  to 


6  R  (u)  3  6 


h  p  y  y 

f  pu"  c  dx  +  p  u  +  a  u*  T 
J  0  EE 

L  ? 
f  p  u  dx 

J  A 


=  0 


(33e) 


R  (u)  is  the  ■  energy  quotient"  for  this  problem,  and  (33e)  shows  that  the 
original  differential  equation  problem  formulated  here  is  equivalent  to  making 
R  (u)  stationary  with  respect  to  admissible  functions  u  only  required  to  satisfy 
the  geometric  boundary  conditions. 

The  preceding  result  will  now  be  generalized  to  the  2m' th  order  self- 
adjoint  system  (l)-(3). 


*  Actually,  as  might  be  surmised  from  the  form  (33c),  u  need  only  have 
a  continuous  derivative  in  [a,  b]  through  order  m-1  =  1,  and  a  piecewise 
continuous  derivative  of  order  m  =  2.  This  will  be  proven  in  Chap.  IV. 

**  The  importance  of  this  arbitrariness  will  be  illustrated  in  an  actual 
application  in  a  subsequent  report. 
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13.  Energy  quotient.  Using  the  Dirichlet  forms  (28),  the  "energy 
quotient"  R  (u),  for  self-adjoint  systems  governed  by  (l)-(3),  will  be  defined 


R  (u)  =  $  (u)/  4j  (u) 


(34a) 


where 


b  m  2 

®  (u)  =  J  \  (u^V  ^  )  ]  dx  +  Mo  (u) 

a  v  =  0 

b  »  .  x  2 

4Mu)  =  J  [  L  (u  >  Jdx+N  (u) 

J  a  v  =0  v  ° 


(34b) 


(34c) 


and  Mq(u)  and  Nq(u)  are  the  expressions  for  Mq(u)  and  Nq(u)  after  the  values 

of  u  and  its  derivatives  at  the  end  points,  given  by  the  boundary  conditions, 

have  been  inserted,  and  all  the  derivatives  of  u  at  the  end  points  of  order 

m  and  higher  have  thus  been  eliminated  from  and  N  .  Note  that  if  u  is 

a  comparison  function,  then  $  (u)  =  4  (u),  (u)  =  (u)»  R  (u)  =  R(u).  For  the 

—  2  2  — 

example  in  Sect.  12,  it  will  be  found  that  M  =  |3  u  +  a  u'  ,  N  =  0. 

O  Lj  I-j  o 

It  will  also  be  useful  to  define,  analogously,  the  quantities  $  (u,  v) 
and  ij)  (u,  v)  for  admissible  functions  u  and  v.  For  this  purpose,  consider 
first  $  (u,  v),  as  defined  by  (4),  in  conjunction  with  Eq.  (3).  Integrating  by 
parts,  the  following  form  is  obtained: 


b  I  \  b 

$  (u,  v)  -  f  ^  f  u'v  v'v  dx  +  [M  (u,  v)  ] 

J  “  /\  V  O  EL 

a  v  =0 


(34d) 


where  M^u,  v)  is  a  bilinear  form  in  u^T\  v^8\  0  <  r  <  m  -  1,  1  <  s  <  2m-l. 
Suppose  now  that  in  the  boundary  term  [M  (u,  v)  the  values  of  u,  v  and 
their  derivatives  according  to  the  self-adjoint  boundary  conditions  are  in¬ 
serted,  thereby  eliminating  all  derivatives  above  order  (m-1).  The  result 
will  be  denoted  by  Mo(u,  v);  ${u,  v)  is  then  defined  by 
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b  m 

«(ufv)  =  f  (  ^  fv  u^v  ^  v^v  *)  dx  +  Mq(u,  v)  (34e) 

a  v  =  0 

with  a  similar  definition  of  4*  (u,  v),  with  f  replaced  by  g,  and  Mq  oy  Nq.  It 
should  be  noted  that  if  u  and  v  are  admissible  functions  (satisfying  only  the 
geometric  boundary  conditions),  and  the  boundary  conditions  are  self- 
adjoint,  then 

Mo(u,  v)  =  Mq(v,  u)  ;  Nq(u,  v)  •■=  Nq(v,  u)  I 

(  (34f) 

fc(u,  v)  =  $  (v,  u)  ;  4(u,  v)  =4l(v,  u)  1 

It  may  also  be  noted  that  from  the  definitions,  $  (u)s$(u,  u)  ;  4  (u)*4*  (u>  u)» 
Finally  it  is  observed  that  if  u  and  v  are  comparison  functions,  then  all 
barred  quantities  will  be  identical  with  the  corresponding  unbarred  quantities. 

The  following  variational  principle  will  now  be  established.  When 
the  problem  (l)-(3)  is  self-adjoint,  the  eigenfunctions  y  will  be  those  2m- 
times  differentiable  functionc  u  in  [a,  b]  satisfying  the  geometric  boundary 
conditions,  for  which  R(u)  assumes  stationary  values;  these  values  will  be 
the  corresponding  eigenvalues  \.  In  this  connection  the  quantity 
P  a  (1/  2)(  $  -  X  i|7  )  may  be  called  the  generalized  total  potential,  since 
the  preceding  principle  is  seen  to  be  a  generalization,  in  one  dimension,  of 
the  familiar  principle  of  the  stationary  total  potential  in  elasticity.  The 
principle  implies  that  the  uy-namical  boundary  conditions  will  be  automatically 
satisfied. 

To  establish  this  principle  it  is  first  noted  that  from  Eqs.  (30)  and 
(34b,  c),  it  follows  that  the  statement  6P  =  0  is  equivalent,  for  any  (suitably 
differentiable)  functions  y,  to 


*  See  first  footnote  of  Section  12. 
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D  ***  *  | 

2  J  [  M(y)  -  X  N(y)  ]  5y  dx  +  S  Mq  -  [  2  ^  A  (y)  6y*P*  ] 
a  p  =  0  p  i 

+  X  j[Z  £  B  (y)  5y(p)  ]b  -  6  Nq  j  =  9 
o=0  "  a  * 


Suppose  now  that  in  MQ(y),  the  m*  th  and  higher  derivatives  of  y  have  been 
eliminated  in  accordance  with  the  (self-adjoint)  boundary  conditions.  Then 
the  resulting  form  Mq  for  becomes: 


m-1  b 

M  (y)  =[  X  A  (y)y(p)] 

p  =  0  p  a 


(36a) 


where  the  A^  (y)  (some  or  all  of  which  may  be  zero)  are  certain  linear  func¬ 
tions,  depending  on  the  particular  boundary  conditions,  of  y  and/  or  its  deri¬ 
vatives  up  to  (m-l)th  order.  By  virtue  of  the  symmetrical  properties  of 
$  (u,  v)  and  Mq  (u,  v)  for  admissible  functions  (cf.  Eqs.  (34f)  )  it  follows  now 
that  an  equation  corresponding  to  (32)  will  hold  for  variations  in  with 
respect  to  admissible  functions,  i.  e. 


,  .  b 

6  M  (y)  =-  2  [  £  A  (y)  6y'P'  ] 

°  p=0  p  a 


(36b) 


A  similar  expression  holds  for  6  N  (y),  with  A  replaced  by  B  .  Thus, 

o  p  p 

Eq.  (35)  becomes: 


f  [M(y)  -  X  N(y))  6y  dx  -  [  £  {  A  (y)  -  A  (y)  J  6y(p)  ] 


p  =  0  p  p 


+  M  L  I  B0(y)  -  BD(yU  fiy(p)^  = 


(37a) 


In  the  example  of  Sect.  12  the  equation  corresponding  to  (37a)  is  Eq.  (33d). 
in  which  j  =  =  0  for  all  p,  (AQ)a  =  (A^)  (A^'  •  (A(  0, 


'x'!x)  is  arbitrary.  Eq.  (?7a)  is  equivalent  tc 


M(v»  -  \  N(y)  =  0  (37b) 

and 

m-.  b 

l  I  I  -My)  -  A0(y)  ♦  M  B  (y>  -  B  (y)  ]  |  6ytc}  1  =  0  (37c) 

where  8  =  B  =  Q  for  p  >  n-1.  Eq.  (37b)  snows  that  the  differentia!  equation 

(1)  will  be  satisfied  by  y.  Moreover,  as  regards  Eq.  (37ci,  suppose  first 
that  all  of  the  boundary  conditions  are  dynamical.  Then  the  €y^  at  the  end 
points  will  ail  be  independent  of  one  another,  and  Eq.  (i~cj  implies 

A  (a)  -  A  (c)  4  X  {  B  (a)  -  B  (a)  ]  =  0  (37d) 

P  f-  p  ? 

u  =  a,  fci  p  -  0.1 . m-1 

Eqs.  *37d)  may  be  considered  as  a  set  of  2m  linear  equations  in  the  2m 

unknowns  (contained  in  the  and  Ep)  y^(a),  fW.,  an<k<  2m  -  1.  Cne 

fki 

solution  is  clearly  that  in  which  the  y  1  (a)  have  the  values  given  by  the  dy¬ 
namical  boundary  conditions,  since  then  A  (a)  =  A  (a)  and  B  (&)  =  B  (a). 

P  P  p  p 

Moreover,  this  will  be  the  only  solution  of  Eqs.  (3'*d),  since  as  can  be  seen 
from  the  form  of  A  {vl  end  B  v  *J  in  Eq.  (3 '•  the  determinant  of  the  eoua- 

p  -  P  ** 

tions  will  not  be  zero.  In  case  some  of  the  boundary  conditions,  say  f  of 

them,  are  geometric  this  argument  may  be  modified  by  noting  that  there 

fc) 

wi.  now  be  f  fr-near)  relations  among  the  Sy'"  (a),  with  (2m-f)  of  the 
5y^^(a)  whir4*  will  be  independent  of  cne  another.  Hence  Eqs.  f37c)  wil’ 
now  yield  (2m-f)  linear  equations  in  {2m-f  4  derivatives  of  y  of  m'th  and 
higher  orders  at  the  end  points.  Cnee  again,  these  will  be  found  .a  have 
as  their  unique  solution  the  (2m-f  )  dynamical  boundary  conditions  of  the 
problem. 


Thus  the  variaHcnai  condition  6P  =  0  with  respect  to  2m -times 
differentiable  functions  satisfying  the  geometric  boundary  conditions  has 
been  shown  to  imply  satisfaction  of  Eq.  (11,  together  with  the  dynamical, 
as  well  as  geometric,  boundary  conditions.  It  follows  that  the  extremal 
solutions  y  will  oe  eigenfunctions  of  (l)-(3),  with  corresponding  eigenvalues 
X  =  R(y)  =  Rfy).  In  a  manner  previously  shown  for  the  corresponding  theorem 
oo  the  Rayleigh  quotient  (cf.  Eq.  (23)),  the  condition  6P=0  is  equivalent  to 
5  R  =  0.  Moreover,  in  a  manner  analogous  to  that  for  K,  the  energy  quotient 
R  can  be  characterized  by  minimum  and  max-min  properties.  In  particular, 
all  of  the  minimum  and  max-min  theorems  which  have  been  established  in 
Chapter  C  for  the  Rayleigh  quotient  R  remain  valid  when  R  is  replaced  by 
K  uis  replaced  by  <1  in  the  orthogonality  constraint  conditions  for  the  higher 
modes,  and  the  comparison  functions  need  only  satisfy  the  eeometric  boundary 
conditions. 


IV.  EXISTENCE  OF  MINIMIZING  FUNCTIONS.  INFINITE 
GROWTH  OF  THE  EIGENVALUES 


14.  Null  spectra;  continuous  spectra.  The  minimum,  and  max-min, 
theorems  which  have  been  proven  here  for  the  eigenvalues  and  eigenfunctions 
of  self-adjoint  problems  are  thus  far  all  conditional  in  nature  inasmuch  as 
it  was  assumed  that  certain  relevant  minimizing  functions  exist.  In  this 
chapter  it  will  be  proven,  under  suitable  sufficient  conditions,  that  these 
do  exist. 

In  actual  buckling  and  structural  vibration  problems,  conditions  are 
usually  automatically  sufficient  for  the  existence  of  minimizing  functions  . 

In  such  cases,  as  already  seen  in  Sect.  9,  the  eigenvalues  are  infinite  in 
number,  but  discrete  (denumerable),  with  a  lowest  eigenvalue  X..  Thus,  to 
one  accustomed  to  such  types  ox  problems,  and  to  such  a  set  (spectrum)  of 
eigenvalues,  the-  existence  of  minimizing  functions,  which  in  specific  cases 
would  be  equivalent,  e.g.,  to  the  existence  of  buckling  or  vibration  mode- 
sKapes,  might  at  first  appear  obvious,  and  hardly  in  need  of  proof.  It  will 
be  seen,  however,  that  the  existence  of  such  functions,  even  under  "  ordinary* 
sufficient  conditions,  is  mathematically  far  from  obvious,  and  is  indeed  one 
of  the  fundamental  and  more  difficult,  but  interesting,  aspects  of  the  theory. 
Moreover,  there  are  important  classes  of  problems  (occurring,  e.g.,  in 
physics)  involving  linear  differential  equations  in  which  the  spectrum  of 
eigenvalues  can  be  of  a  quite  different  nature  from  the  discrete  spectrum 
indicated  above.  This  may  occur  when  the  differential  equation  is  singular 
at  one  (or  both)  end  points,  or  when  the  domain  fa,  b]  is  infinite.  In  such 
cases,  the  spectrum  of  eigenvalues  may  still  be  of  the  "  customary"  type 
indicated  above,  but  it  may  al3o  be  of  a  variety  of  different  types,  depending 
on  the  differential  equation  and  on  the  boundary  conditions  (which  now  may 


even  be  less  in  number  than  the  order  of  the  differential  equation).  For 
example,  no  eigenvalues  at  all  may  exist,  or  the  spectrum  may  consist  of 
a  continuous  set  of  numbers.  In  the  first  case,  the  spectrum  may  be  called 
"  null”  ,  and  in  the  second  case  the  spectrum  is  called  "  continuous"  .  A 
simple  example  will  suffice  here  to  indicate  these  possibilities. 

Consider  the  ("  Euler -Cauchy"  )  differential  equation 

(x2y')'  =  Xy  (38a) 

with  the  domain  [0,  1]  for  x.  Eq.  (38a)  has  a  (regular)  singular  point  x  =  0, 
but  is  nevertheless  of  the  type  (1),  (3).  The  general  solution  of  this  equation 
is 

nl  n2 

y  =  Ax  +  3x  L  (38b) 

where  A  and  B  are  arbitrary  constants,  and  n^,  n^  are  the  roots  of  the 
equation 

n2  +  n  -  V  =  0  (38c) 

(a)  Suppose  the  boundary  conditions  are: 

y(0)  =  y(l)  =  0  (38d) 

Then  the  system  (38a),  (38d)  is  not  only  of  the  type  (1)  -  (3)  but  is  also 
self-adjoint.  It  will  be  readily  found,  however,  that  there  are  iio  values  of 
X.  for  which  a  non-trivial  solution  for  y  will  exist. 

(b)  Le;  the  boundary  conditions  now  be  only 

y(°)  =  0  (38e) 


n. 


!f  n2  = 


nj,  then  the  general  solution  is  y  =  x  (A  +  B  log  x). 
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Then  (considering,  for  simplicity,  only  the  real  eigenvalues  of  \)  it  will  be 
found  that  the  set  of  real  values  of  X  for  a  non-trivial  solution  is  X  >  0.  This 
is  a  continuous  spectrum.  (If  (38e)  is  replaced  by  y(0)  =  finite,  then  the  set 
of  real  values  for  X  will  be  X  >  0). 

The  above  illustration  indicates  the  possibilities  of  a  continuous 
spectrum  as  well  as  of  no  (non-trivial)  solution  at  all  (null  spectrum).  Thus, 
the  existence  of  minimizing  functions,  etc.,  appearing  in  the  theorems  of  the 
preceding  sections  cannot  mathematically  be  taken  for  granted  apriori,  but 
must  be  proven,  under  suitable  sufficient  conditions. 

The  type  of  proof,  based  on  minimizing  sequences,  to  be  given  here 
will  also  be  relevant  in  the  proof,  to  be  given  in  a  subsequent  report,  of  the 
convergence  of  the  Ritz-Galerkin  type  of  methods  in  all  modes. 

15.  Auxiliary  theorems  on  point  sets  and  functions.  In  the  proofs  to 
follow  a  number  of  well-known  theorems  on  infinite  sets  S  of  real  numbers 
will  be  used.  First  it  is  recalled  that  a  limit  point,  or  point  of  accumulation, 
of  S  is  defined  as  a  point  P  such  that  in  any  neighborhood  of  P  there  exist3 
an  element  of  S  distinct  from  P.  P  itself  may  or  may  not  be  an  element  of 
S.  The  following  theorems  are  now  noted. 

(a)  If  S  has  a  lower  bound,  then  it  has  a  greatest  lower  bound. 

(b)  If  S  has  a  greatest  lower  bound  B,thenB  must  have  at  least  one 
of  the  following  two  properties:  B  is  a  member  of  S,  or  B  is  a  limit  point  of 
S.  (The  proof  follows  almost  immediately  from  the  definitions  ). 

*  By  definition  of  greatest  lower  bound,  mere  will  be  at  least  one  element 
s  in  S  such  that  s  <  B  +  e  for  any  e  >  0.  If  B  is  not  in  S,  then  this  element 
s  must  be  distinct  from  B.  Hence  if  B  is  not  in  S,  B  must  be  a  limit  point 
of  S.  (This  does  not  preclude  the  possibility  that  B  is  both  in  S  and  also 
a  limit  point  of  S). 


(c)  If  S  has  a  limit  point  P,  then  there  is  a  sequence  of  elements 

e  in  S  such  that  lim  e  =  P. 
n  n— <*oo  n 

(d)  If  S  is  a  bounded  infinite  set,  then  there  exists  at  least  one 
limit  point  of  S.  (Bolzanc  -Weierstrass  t!  eorem). 

(e)  Aninfinite  sequence  {s  }  of  non-decreasing  real  numbers  with  an 

KL 

upper  bound  is  convergent;  its  limit  is  the  leastupper  bound  of  the  numbers  s  . 

(f)  An  infinite  sequence  (s^  }  of  non-increasing  real  numbers  with 
a  lower  bound  is  convergent;  its  limit  is  the  greatest  lower  bound  of  s^. 
Theorems  (a)  and  (c)-(f)  are  proven,  for  example,  in  Ref.  15.  In  view  of 
theorem  (c),  it  is  noted  that  the  Bolzano-Weierstrass  theorem  (d)  can  also 
be  stated  as  follows: 

(g)  If  S  is  a  bounded  infinite  set  then  there  exists  a  sequence  of  ele¬ 
ments  eR  in  S  which  converges  to  a  limit  point  of  S. 

In  addition  to  the  above  theorems  on  sets  and  sequences  of  numbers, 
several  general  theorems  on  functions  will  be  needed.  The  notation  of 
Eqs.  (4)  will  be  used, 

(h)  Let  M  be  any  linear  differential  operator,  and  let  u,  v  be  func¬ 
tions  of  a  class  for  which  the  M-norm  exists  and  for  which  M  is  symmetric, 
i.e.,  $  (u,  v)  =  $  (v,  u)  for  all  u,  v  of  the  class.  Then 

$>  (ru  +  sv)  =.  r2  $  (u)  +  3  $  (u,  v)  +  s2  $>  (v)  (39) 

where  r  and  s  are  any  constants.  Eq.  (39)  follows  readily  from  the  defini¬ 
tions,  Suppose  now  that  M  is  positive  semidefinite.  Then 

2 

(  $  (u,  v)  )  <  $  (u )  •  $  (v)  (40a) 

The  inequality  (40a)  follows  by  letting  s  =  1  in  (39)  and  considering  the  right 
side  of  (39)  as  a  quadratic  in  r.  Since  the  latter  cannot  be  negative  for  any 
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real  r,  it  must  either  have  no  real  roots  or  a  double  root.  This  require¬ 
ment  immediately  yields  (40a).  For  the  special  case  of  M  =  I,  the  identity 
operator,  (40a)  is  the  well-known  Cauchy-Schwarz-Bunyakowsky  inequality: 


(u,  v)  <  (u,  u)  •  (v,  v) 


(40b) 


(i)  Let  f.(x)  be  a  sequence  of  differentiable  functions  in  [a,  b], 
and  let  the  sequence  if  (x)  converge  uniformly  in  [a,  b]  to  a  function  p  (x). 
Moreover,  let  there  be  at  least  one  point  £  in  [a,b]  for  which  the  sequence 
f.(£)  converges.  Then  the  sequence  f.(x)  will  converge  uniformly  in  [a,  b] 
to  a  function  F(x),  such  that  F(x)  =  $  (x).  This  is  proven  in  Ref.  16(with 
series  instead  of  sequences),  and  is  related  to  differentiation  of  an  infinite 
series  or  sequence. 

b  2 

(j)  If  for  a  sequence  of  functions  u^fx),  f  *(x)  u^(x)  ^x 

a  b  ^ 

is  bounded,  and  f(x)  is  continuous  and  >  0  in  [a,  b],  then  J^Sf  J  u.^  (x)  dx 

a 

will  be  bounded.  This  is  readily  proven  by  noting  that  the  hypotheses  imply 

L  <  A,  and  also  f(x)  >  C  >  0  in  [a,  b], where  A  and  C  are  fixed  numbers. 
h  “  ~  b  . 

Hence  0  <  Jh  <  (l/ C)  J  f(x)  u^  dx  <  k!  C. 

CL 

(k)  Let  cp  (x)  be  a  given  piecewise  continuous  function  in  [a,  b], 
and  w(x)  an  arbitrary  function  with  a  continuous  (m+  l)th  derivative  in 
[a,  b]  satisfying  the  boundary  conditions 


w^(a)  =  w^v  ^(b)  =  0, 


v  —  0,  1,  . . . ,  m 


(41a) 


Suppose  that 


I  ®  w 


,(m+l) 


dx  =  0 


(41b) 


for  all  w.  Then  m  (x)  must  be  a  polynomial,  P  ,of  m'  th  or  lower  degree. 

m 

This  is  a  theorem  of  Zermelo. 
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Proof.  If  cp  (x)  =  P  ,  then  (41b)  is  satisfied,  since 


f  pm(x)  w^m+i^(x)  dx  =  0 


(41c) 


by  successive  integrations  by  parts,  with  the  conditions  (41a).  To  show 

Y 

that  m  (x)  cannot  be  any  other  function,  let  P  (x)  =  C  +  C,  x  +  . . .  +  C  x 
v  '  1  m'  o  1  m 

where  the  C.  are  determined  so  that 
i 

b  .  b  . 

J  x1  Pm(x)  dx  =  J  x1  ©  (x)  dx,  i  =  0,  1 . m  (41d) 


Eqs.  (41d)  are  a  set  of  (m  +  1)  linear  equations  in  the  (m  +  1)  unknowns 
Cq,  .  . ,  and  will  have  a  unique  solution  for  a  given  cp  (x).  For,  in  the 
special  case  in  which  cp  (x)  is  a  polynomial  of  m*  th  or  lower  degree,  the 
C  would  have  to  satisfy  (41d),  while  for  such  a  cp  (x)  not  identically  zero, 
the  right  side  would  be  non-zero  at  least  for  some  i.  Hence,  since  a 
solution  for  the  Ch  exists  in  that  case,  the  determinant  of  the  system  in 
that  case  cannot  vanish,  and  therefore  cannot  vanish  for  any  given  cp  ,  since 
the  determinant  is  the  same  for  all  cp  .  It  is  now  noted  from  Eqs.  (41b)  and 
(41c)  that  for  any  given  cp  , 


/  (CP  -  Pm)  c»(m  +  l>  dx  = 


(41e) 


for  all  w.  Moreover,  a  w(x)  exists  (i.  e.,  satisfying  (41a))  such  that 


W^n+1\x)  =  cp  (x)  -  Pm(x) 


The  required  w(x)  is  obtained  by  successive  quadratures  of  (41f)  each 

satisfying  the  boundary  conditions  at  x  =  a  in  (41a).  The  conditions  at 

x  =  b  will  then  be  automatically  satisfied.  For,  the  first  quadrature  yields 

w^m^(x)  =  f  [cp  (x)  -  P  (x)]  dx,'  by  vi'tue  of  (41d)  for  i  -  0,  (41a)  is  satis- 
a  b  ,  -  b 

fied  for  v  =  m.  It  is  next  noted  that  I  *  f  x  w  (x)  dx  =  x  w 

1  a  a 
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-  f  w^m^dx  =  -  J  w^m\'x)  dx.  But  (41d)  with  i  =1  and  (4If)  imply  Ij  =  0. 
Hence  vlm  ^(x)  =  J  w^m/(x)  dx  will  now  satisfy  (41a)  for  v  =  m  and  m-1. 

By  considering  I,  2  J  x  w  m  (x)  dx,  etc.  one  can  finally  show  that  w(x) 

3.  (lYX  ,  i  \ 

will  satisfy  all  the  boundary  conditions  of  (41a),  In  (41e)  now,  let  w  m  (x) 
be  given  by  (41f).  Then  this  will  imply  cp  (x)  =  P^fx)  • 

(1)  Let  u^(x)  be  an  infinite  sequence  of  functions  each  of  which  has 
continuous  derivatives  in  [a,  b]  through  some  order  p.  Moreover,  for  some 
q,  (  0  <  q  <  p),  let  there  be  an  interval  (x,,  x_)  in  [a,  b]  for  which  the  sequence 

v  I 


.(m). 


J  [u  (x)  ]  dx  is  bounded.  Then  there  will  exist  a  sequence  of  points 

Xj  h 

v  in  [a,  b]  such  that  for  each  v,  q  <  v  <  p,  the  sequence  u^(|^  is 
bounded.  This  is  an  important  theorem  here  which  can  be  proven  in  the 
manner  of  Ref.  13,  pp.  85-87,  although  the  theorem  is  not  explicitly  stated 
there  in  this  generalized  form. 

(m)  Finally,  the  theorem  of  Arzela,  also  known  as  the  theorem 
of  Ascoli,  must  be  noted.  This  is  an  extension  of  theorem  (g)  to  functions. 

In  stating  this  theorem  it  is  first  necessary  to  define  a  set  of  equicontinuous 
functions.  A  set  of  functions  f(x)  is  said  to  be  equicontinuous  if  for  every 
e  >  0  there  exists  a  6  >  0,  depending  on  e  but  not  on  the  particular  function 
f(x)  of  the  set,  such  that  if  |x^  -  x^  (  <  6(e  ),  then  | f (x^)  -  ffx^,)  |  <  e  ,  where 
Xj  and  x^  are  in  the  domain  D  of  the  independent  variable.  The  theorem 
of  Arzela  states  that  a  set  of  uniformly  bounded  and  equicontinuous  functions 
in  a  given  bounded  domain  D  is  compact,  i.e.  from  such  a  set  it  is  possible 
to  choose  a  sequence  of  functions  which  converges  uniformly  to  a  continu¬ 
ous  limit  function  in  D.  This  theorem  is  proven  in  Refs.  4  and  6. 


16.  Existence  of  a  minimizing  sequence  for  R.  An  *  admissible" 
function  u  for  the  energy  quotient  R(u)  will  henceforth  be  defined  as  one 
which  satisfies  the  geometric  boundary  conditions,  and  has  a  continuous 


(m-l)th  derivative,  and  a  piecewise  continuous  m*  th  derivative  in  [a,  b]. 

It  will  be  assumed  that  M  is  positive  semidefinite  and  N  positive  definite. 

Then  for  any  admissible  function  u,  $(u)  >  0,  i|i  (u)  >  0  and  hence  R(u)  >  0. 
Therefore  by  auxiliary  theorem  (a),  R(u)  has  a  greatest  lower  bound,  say 
B.  Consider  now  the  set  S  of  all  values  R(u)  for  admissible  functions  u. 

If  S  is  finite  then  the  function  for  which  R(u)  assumes  the  lowest  value  will 
be  the  minimizing  function,  and  the  existence  of  a  minimizing  function  for 
R  is  thus  proven.  Suppose,  then,  that  S  ic  infinite.  Then  from  auxiliary 
theorem  (b),  B  is  a  member  of  S  or  it  is  a  limit  point  of  S.  In  the  former 
case,  the  existence  of  a  minimiring  admissible  function  for  R  is  again  proven 
Hence  suppose  thatB  is  a  limit  point  of  S.  Then  from  auxiliary  theorem  (c)  t 
there  exists  a  sequence  of  values  of  R(u),  with  the  corresponding  sequence 
ju^l  of  admissible  functions  u^,  such  that 

lim  R  (u^)  =  B  (42) 

h  — *  co 

A  sequence  such  as  u.  is  called  a  minimizing  sequence  for  R  (u).  Note  that 
the  existence  of  such  a  sequence  has  been  proven  here  without  the  necessity 
of  specifying  how  such  a  sequence  is  to  be  constructed  in  practice.  (The 
actual  construction  of  such  sequences  forms,  in  fact,  the  basis  of  such 
methods  as  the  Rayleigh-Ritz  to  be  discussed  in  a  second  report.) 

The  existence,  just  proven,  of  a  minimizing  sequence  satisfying 
an  equation  of  the  type  (42),  does  not  per  se  necessarily  imply  that  an  ad¬ 
missible  function  U  exists  for  which  R  (U)  ~  B.  A  simple  illustration  of 
this,  due  to  Weierstrass  and  given  in  Ref.  8,  is  the  following.  Find  a 
continuous  function  y,  with  a  continuous  derivative  in  [-1,1  ],  which  satis¬ 
fies  the  boundary  conditions 


y  (-D  =  -  1  , 


y(l)  =  i 


(43a) 
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arid  for  which  the  functional 

.,*2  2 

I  (y )  S  I  x£  y'  c  dx 

-1 


(43b) 


is  a  minimum.  Since  I(y)  >  0,  I(y)  has  a  lower  bound,  and  hence  a  greatest 
lower  bound,  say  B,  >  0.  The  value  of  B  is  here  zero,  as  can  be  shown  by 
considering  the  set  of  functions  y  (x)  defined  by 


y^  (x)  =  arctan  (x/  e  )  f  arctan  (1/  e  ) 


(43c) 


where  e  is  a  positive  number.  These  functions  are  clearly  admissible 
here.  (They  satisfy  (43a)  and  have  a  continuous  derivative  in  [-1,  1]  ). 
Moreover, 

I  (y  )  <  f  (y?  +  e  “)  y’ 2  dx  =  2c  /  arctan  (l/e  )  (43d) 

e  '-1  e 

From  (43d)  it.  follows  that  I(y^  )  can  be  made  as  close  to  zero  as  desired 

by  choosing  a.  sufficiently  small  e  >  0,  Hence  greatest  lower  bound  of 

I(y)  is  B  -*  0,  and  a  minimizing  sequence  for  I(y)  would  be  y^  (x),  with 

(for  example)  c  =  1/  n,  n  =  1,  2,  . . ,  .  Nevertheless  there  does  not  exist 

any  admissible  function  y  for  which  I(y)  =  b  =  0.  For,  such  a  function 

2  2 

would  here  have  to  be  such  that  the  integrand  x  y1  is  identically  zero; 
the  only  such  function  is  y  =  const,,  which  cannot  satisfy  the  boundary 
conditions  (43a).  It  may  also  be  noted,  incidentally,  that  although  each 
function  y  (x)  of  the  minimizing  sequence  (43d)  is  continuous  in 
the  limit  Y(x)  of  this  sequence  as  e— »  0  is  a  discontinuous  function: 

Y(x)  =  -  1  for  -  1  <  x  <  0;  Y(C  )  =  0;  Y(x)  =  1  for  0  <  x  <  1.  (The  con¬ 
vergence  of  y^  (x)  to  Y(»)  is  non-uniform). 

It  will  now  be  shown,  under  certain  suitable  sufficient  conditions, 
that  an  admissible  function  U  for  which  R(U)  =  B  does  exist.  The  proof, 
based  on  applying  the  theorem  of  Arzela  to  a  minimizing  sequence  for  R(u), 
will  be  along  the  lines  of  Rtf.  13. 


17.  Assumptions.  The  problem  defined  by  Eqs.  (1)  -  (3)  is  assumed 
self-adjoint,  and  F(x)  and  gv(x)  are  assumed  to  have  continuous  derivatives 
in  [a,  b]  through  ore  Moreover,  it  is  assumed  that: 

fm(x)  >  0  ;  fv(x)  >  0,  gv(x)>  0  (v  =  0,  ...,m-l) 

fDW  +  80(x)  ^  0  »*  M0(u)  >  0  **  Nq(u)  >  0  (44) 

(u)  >  0  ,  $  (u)  >  0 

where  u  is  any  admissible  function  not  identically  zero.  These  assumptions 
have  all  been  grouped  together  for  convenience.  They  are,  of  course,  not 
entirely  independent  of  one  another.  In  particular,  the  assumptions  on 
f^fx),  g^(x),  Mq(u)  and  Nq(u)  imply  ^  (u)  >  0,  4  (u)  >  0.  Here  it  is  assumed 
in  particular  that  vp  (u)  >  0,  l.  e,  N  is  a  positive  definite  operator.  The  con¬ 
dition  fQ(x)  +  gQ(x)  ^  0  can  be  relaxed,  as  will  be  indicated  subsequently. 
Although  the  assumptions  (44)  may  perhaps  at  first  sight  appear  somewhat 
restrictive,  it  is  noted  that,  among  other  possible  physical  situations, 
they  are  typically  satisfied  in  a  variety  of  actual  structural  buckling  and 
vibration  problems. 

From  (u)  >0,  $  (u)  >  0  it  follows  that  R  (u)  >  0,  and  therefore 
that  R  (u)  has  a  greatest  lower  bound,  B.  As  shown  in  Section  16,  there  will 
hence  exist  a  minimizing  sequence  of  admissible  functions  u^(x),  h  =  1,  2,  .  . . 
for  R(u).  Since  41  (u^)  >  0,  the  functions  u^(x)  may  be  normalized  so  that 

4?  (uh)  =  1  (45a) 

Then  from  Eq.  (42), 

lim  $  (u^)  =  B  (45b) 

h  *”•’  oo 

The  nature  and  convergence  of  the  sequence  u^(x)  and  of  the  sequence,  of 
derivatives  of  u^x)  will  now  be  analyzed. 


follcwinp  statement  will  first  be  pr cved.  fil  *V-r  every  0  <  ■»  <  m-i  ami 
every  h,  there  is  a  point.  ^  in  U,b;  so  that  the  sequenc t  o!v^  ),  h-1.2. 

is  SKJCoiv-d.  rh:s  ti  mover  by  first  r.o».ici  from  j3-»b.  c>.  (-*41  and  ffSa.b: 

that  *  ft  3  ^  3u!"  di  miT-st lx  bcasaei.  Hence,  since  :  *j  >0.  I  cii  *r  )c 
a  *t 

must  be  bourc^d.  where  .  r.^)  is  any  interval  in  *a.b>.  From  assumptions 

(44)  there  must  exist  an  interval  (x,  .  x.l  in  f  a ,  bl  far  vhich  i  |x)*  *  (x)  >0. 

12'  o  © 

.*2  > 

C(cs«jwr.ttr,  from  the  ia.'iiiary  theorem  f;l  of  section  15,  (  «bc  sri»l 

-Ttj  h 

be  bounded  Statement  (1)  then  follows  from  the  auxiliary  theorem  f  f  >  with 
p  -  m*  1,  q  -  3  there. 

With  the  use  at  3Wtr.nest  fl)  it  *  an  be  shown  that:  J?) For  each  v, 

(vl 

0  <  v  <  m~l.  the  sequence  u  ^  (x)  (h  -  1, 2. _ >  is  acifomiy  branded  in 

U,  tor. 


frc-1)  ,  .  (m-i)  .>  ,  ,  .•  ,  , 

n  fx)  *  u.  It-  .)+  f  ..(m) 

t  n  6,  m-t  -  .  \  « 

*h.  m-i  & 


(46) 


(mj 


,  c 


But  iroin  (54b).  {4jdI,  and  assumptions  (44),  the  sequence  f  )  dx 

« 

must  be  bounded;  hence,  since  f  (x)  >  0  and  is  ccudsucus  in  [a,  b]»  the 

b  2  ^ 

fm' 

sequence  “  )  dx  must  also  be  bounded  (auxiliary  theorem  'dj. 

Moreover,  fiosn  (40b), 

b  ,  ,  b  .  2  12 

{*  i*Km>!/»x  <  [  (b-a)  f  (*£*>)  dx] 

a  a 

f . 

’n tr.ee  t  ’u/  '  ;dx  must  be  bounced.  It  then  follows  from  Eq.  (4b),  ir. 

a  .  (m-1) 

conjunction  witr.  statement  (1).  that  the  sequence  u.  *  (s)  is  uniformly 

bounded  in  fa,  bf.  From  Eq.  (46)  with  n:  successively  neve  replaced  by 

(m-I),  (m-2),  etc.,  statement  (2)  is  readily  seen  to  follow  for  each  of  the 


v  indicated. 


o’  tv 


It  will  awt  be  worm  t*«:  II)  For  each  t. 


M 


?  <  t  <  rrt-U  the 

$«jvafe  a'^'(x)  is  e^s.castiKws  h  (a.,  b}.  For  0  <  »  <  m-2.  statement 
€31  follows  readily  be  observing,  from  statement  (2)  that  for  each  sech  . 


there  wiil  be  a  positive  C^.  tac^pradect  of  h,  such  that  for  any  x, 


and  X. 


m 


Ja.b|,  *  4 -  ’  !  f  2  <ix  j  <  ?x>-x.?C  . 


The*,  taking  6(e)  <c  f  C^,  the  equicontmuity  defisitioo  (giver.  is  tcnlian 

(r) 

theorem  Cm))  is  seen  to  be  satisfied  for  the  setjuence  u.Q  (x).  0<  •  <  ja-2. 
For  r  =  rn-I,  it  is  first  observed  that  since  f  (x)  >  0  and  is  coctiarxjs  in 
[a,  b],  there  is  a  positive  T>  such  that  D  is  [»,  b],  Hence,  u»int  also 


(40b). 


,  (m-i)  (m-ll  ,2 

lUh  (x2*  *  ua  <*!>  5 


(  fZ  1 


ax  1 


.  |  ,  I  ,'*2  f  Cm).2.  !  ^  .  ,1  ■* .  .  (m)  2  , 

<  l^-x,!  |  j  ^  la,  !«V  V  5-  J  *r>h  ?  <** 


t 

_<  I  *j  •  *  (a^?/  D.  from  (34o)  and  (44).  Since  by  (45b), 
f  {a^l  is  bounded,  statement  (3)  is  thus  seen  to  hold  for  v  -  m  -  l. 

Che  can  now  prove  the  following:  (4)  For  a  suitably  chosen  sub¬ 
sequence  (which  nay  again  be  denoted  by  hi  of  the  indices  a,  tse  functions 

S  j- j 

u^(x-  and  their  derivatives  u^  *  (si,  0  <  >  <  m-1.  converge  uniformly  to 

fv) 

an  admissible  function  U(x)  and  to  V*  (x>,  respectively,  where  t*(x)  fear 
continuous  derivatives  through  order  (m-I)  in  [i.b],  and  satisfies  the 
geometric  boundary  conditions.  This  statement,  which  is  the  main  result 
of  this  Section,  follows  by  first  observing  from  statements  (2)  and  (3), 
and  the  theorem  of  ArzeU  (auxiliary  theorem  (m))  applied  for  -  m-1, 
that  a  subsequence  {which  may  stili  be  denoted  by  h)  of  the  indices  h  can 
be  chosen  so  that 


lim 

h  -»  co 


%  U’ 


U  .(x) 
rr.-l 


(4?) 


-44- 


'■h**re  V_  ,?5»  is  a  c  cr.tinvvU'*  ..tier,  in  [s.  bl.  Moreover,  from  statement 

?2)  appliec  !c  u._  -a)  ir.u  :r<c:  auxiliary  tnc-rrm  (g)  it  follows  that  one  can 

(vl 

choose  the  above  subsequence  so  that  *a  of  :r«  sequences  '{a), 

0  <  <  rr-I.  converses.  Tr-rr  auxiliarr  theorem  (i)  vith  f.(x)  =  u '  *  '{xl, 

—  -  }  h 

ar.d  i  =  a.  ar.c  from  the  uniform  ccewrjefice  in  (47),  it  then  follows  that 

fm- 

the  sequence  u  ’  "(Vi  is  uniformly  convergent  in  [a,  b],  with  a  limit  func¬ 
tion  U  ,(xl  such  IT  '  (x)  =  V  ,[x).  Application  of  auxiliarv  theorem 

m— .  m — _  m— i 

(i)  now  to  the  sequence  u^m  ^(s)  shows  that  the  sequence  Uj|m *^(x)  will 

converse  uniformlv  to  a  function  U  ,(x),  where  U  .(x)  =  U  ,(x). 

rr.-j  m*j  m-t 

Further  successive  application  of  auxiliary  theorem  (i)  thus  shows  that 

(r) 

u  (x>  converges  uniformly  to  a  limit  function  U^(x),  where  for  0  <o  <  m-2, 

V*  (x)  -  U  .  Jx).  Dtaodisc  U  (x)  hv  Ufx),  it  is  seen  that  statement  (4)  will 

v  rri  '  O 

readily  follow.  The  fact  that  U{x)  will  satisfy  the  erometric  boundary  con¬ 
ditions  (which  involve  only  0  <  v  <  m-1 1  fellows,  of  course,  from  the 

rest  of  statement  (4)  just  proven,  and  from  the  fact  that  each  u^(x)  satisfies 
these  conditions.  Statement  (4j  and  Eq.  (45a)  imply  that 

S  (U)  =  1  (48) 

i°.  Differentiability  of  U(x)  through  2m*  th  order.  The  function 
V(x)  obtained  in  Section  IS  will  now  be  shown  to  have  continuous  derivative  , 
in  la,  b;  through  order  2m.  Differentiability  of  Ufx)  beyond  the  (rr:-l)th 
order  most  be  proved  for  at  least  two  reasons  Firrt,  in  order  to  shew 
that  R  (U)  =  3,  it  will  be  necessary  to  show  that  the  derivative  of  at  least 
m*  th  order  cf  U  exist*  in  •  a,  bj.  Second,  it  wiii  then  be  desired  to  apply 
the  results  of  Section  13  to  show  that  U  must  be  a  comparison  function 
minimizins;  tr.e  Rayleigh  quotient  R.  The  analysis  of  Section  13,  however, 
assumes  the  2m’ th  differentiability  of  U. 

The  following  statement  (A)  will  first  be  proven. 


A,  Let  v  ix)  be  a  secucn  e  of  admissible  functions  for  which  4  (v  l 
a  h 


and  „  (v  )  are  bounded,  and  for  which 
h 


*  (uh*  v  =  0 


(49a) 


Then 


lim  4  (u,  ,  v,  i  -•  j 

.  h  r. 

i.—*  or 


(49n. 


Proof:  It  is  noted  (Eqs.  (34f )  )  that  for  admissible  functions  u.  v  and 

self-adjoint  boundary  conditions,  f (u,  v)  =  4  (v,  uj,  o  (u,  v)  -  ;  (vtu),  Hence 

Eq.  (39)  will  hold  fer  4  and  o  .  Let  now  and  s^  be  any  chosen  sequence 

of  constants.  Then  w,  =  r,  u.  +  s,  v,  will  be  admissible  functions,  and 

h  h  r.  h  h 

hence  4  (w^)/i  fw^)  B.  From  Eqs.  (39)  for  4  and  4>,  (45a),  and  (4<a), 
this  inequality,  letting  >  0,  is  found  to  imply 

2  (  *(uh,  vh)  *  (  i)2(  *(^n)  -B]  >  -  [  4  (vh)  -B  I  fvh)  ] 

h  h 

(49c) 

This  must  hula  for  all  h,  r^  and  s^.  It  xs  now  observed  that  b>  the  hypothesis 

the  entire  quantity  on  the  right  side  of  (4<>c)  will  remain  bounded  as  h  —  co, 

and  be  negative  or  zero.  Moreover,  although  (4  (u^l  -  Bi  wil  be  non- 

negative,  it  will  approach  zero  ash  —  cc  fEq.  (45b)).  Consequently,  in 

order  that  (4Sc)  hold  as  h  —  co  for  arbitrary  (r.  /  s.  ),  Ec.  (49b)  must  held. 

n  n 

For  otherwise,  for  some  sufficient!}’  large  one  could  always  find  an 
(r^  /  s^)  of  opposite  sign  to  *  (u^,  v^)  in  ruch  a  way  that  (4Qc)  would  be  violated 
With  the  aid  of  statement  (A),  the  following  will  be  proven: 

(B)  For  every  admissible  function  w(x). 


*  (u^,  w)  -  B  0  (u^,  w)  —  C  as  h  —  co. 


(4Pd) 


To  prove  this,  let 


■\  -  (u^,  w) 


(49el 


-46- 


Then  v,  will  be  an  admissible  function,  Eq.  (39)  may  therefore  be  applied 
h 

to  #,  to  yield 

*  (v^)  =  Z  (w)  -  zZ  (u^,  w)  *  (u^,  w)  +  <l>  **  (u^,  w)  *  (>i^)  (49f) 


From  (45a,  b),  *  (uh)  and  «l»  (v^)  will  be  bounded  as  h  —  co  ;  hence  from 
(40a),  *  (u^,  w)  and  i  (u^,  w)  will  be  bounded  as  h  -  od.  Therefore  (49f) 
implies  that  also  *  (vh)  will  be  bounded.  From  (39)  applied  to  +(vh),  and 
from  (45a),  it  follows  that  4*  (v^)  =  di  (w)  -  w).  Hence  the  4»  (vfc)  will 

be  bounded.  It  is  finally  noted  from  (49e)  that  4*  (u^,  v^)  =  4>  (u^,  w)  -  «|i  (u^,  w). 
Z  (uh)  =  0.  Therefore  statement  (A)  can  be  applied  here,  to  yield 

#  v^)  —  0  as  h  —  oo.  With  vfa  given  by  (49e),  this  yields  *  (u^,  w)  - 

*  (u^)  Z  (u^.  w  )  —  0,  which  in  view  of  (45b)  yields  (49d). 

It  can  now  be  shown  that  U(x>  has  a  continuous  2m‘th  derivative  in 
[a,  b).  Let  w(x),  as  in  the  auxiliary  theorem  (k)  of  Section  15,  be  an  arbitrary 
function  with  a  continuous  (m  +  l)th  derivative  in  [a,  b],  satisfying  the  bound¬ 
ary  conditions  (41a).  For  such  a  function  («h,  w)  =  0.  ^(u^w)  =  0, 

and  m  b  n 

♦  (u^  w) -B*  (u-n.  w)  =  f  (  £  ^  fy  Ubv)  w{v))dx  -  B  gy  ^V>w(v))dx 

(49g) 

i.f  jv  (3 4- ) ).  By  integration  by  parts,  each  term  on  the  right  side  of 
(49g)  can  be  transformed  so  that  it  will  have  a  factor  of  w 
purpose,  as  in  Ref.  13,  let 

fv  u<">  3  V„, ;  *  G'C1  <V 


(m+1) 


For  this 


I  Fv  (u^  dx  S  Fl  5  (u.J  ,* 


n' 


X  Clk'11  :“h>  *  *c!vk,,V 


(49h) 


Note  that  for 


k  >  0  the  F^k'(u,  )  are  essentially  integrals  (k  =  1)  or  iterated 
v  h 


(  v) 

integrals  (k  >  1)  cf  f  .  Ircegrating  by  parts,  and  using  conditions 
(41a),  one  then  finds 

/b  fv  »(v)  dx  =  <-l)m+1  f“  F(m'v+l1  (uh)  Jn,+1>  d,  (49i) 

a  v  '  a  v 

and  a  similar  equation  with  f  and  F  replaced  by  g  and  G,  respectively. 

From  statement  (B)  (see  Eq.  (49d)),  the  left  side  of  (49g)  approaches 
zero  as  h  —  oo,  and  hence  from  (49i), 

lim  fb  |  f  Ffm-v+1l  <u.  )  -  B  ?  G  [m-v+l]  j  Jmfl)  = 
h  —  oo  a  » v=0  v  n  vr0  v  h  » 


(49j) 


For  the  term  with  v  =  m,  it  is  noted  that 

x 


_  [1]  ,  v  r*t  (in).  -  (m-1)  ,X  fX  ,  (m-1)  . 

Fm  'Vs/  'm^  ‘  "h 


a  a 


(49k) 


Consequently,  it  is  seen  that  on]’-  u^  and  its  derivatives  through  (m-l)th 

order  appear  in  (49j).  Hence  in  view  of  statement  (4)  of  Section  18,  one  may 

replace  u^  ly  U  in  (49 j)  in  passing  to  the  limit,  and  infer 

u  m  n  r  ,  i 

f  i  T  F  tm-v+"  (U)  -  B  T  C  (U)  dx  =  0 

J .  I  *~i  n  V  ry.  V  1 

a  fv-0  v-0 

Applying  the  auxiliary  theorem  (k)  of  Section  15,  it  now  follows  that 


m 


T  pfm-vrlj  (lJ)  _  B  \  Gfn.-v+ll  ^  _  p  , 

vf0  v  v  m 


v  =  0  v 


(490 


where  ?m(x)  is  a  polynomial  of  degree  m  or  less.  Hence  the  left  side  of 

(4tl)is  (continuously)  differentiable  m  (and  more)  times. 

Recalling  from  statement  (4)  of  Section  18  that  U  is  (continuously) 

differentiable  at  least  through  order  (m-1),  it  is  noted  that  each  individual 

term  on  the  left  side  of  (49f )  for  which  0  <  v  _<  m-I  is  differentiable. 

F]  1 

This  includes  all  terms  in  (49f )  except  the  single  term  Fj^  ’  (V),  which 
must  therefore  also  be  differentiable.  But  by  (49k),  noting  that  its  right 


/_-  _  l  \ 

side  involves  only  u,  ,  it  follows  that 

n 


^  (U)  =  f  u(m-r 


j  t'  U(m  ^  dx.  Since  the  integral 

^  _  1  / _  i  % 


here  is  differentiable,  it  follow's  that  f  Um  \  and  hence  that  U^111  is 

m 

differentiable.  Thus  U  must  have  continuous  derivatives  in  [a,  b]  through 
at  least  m'  th  order. 

To  prove  U  differentiable  beyond  the  m'  th  order,  it  is  noted,  from 


the  definitions  (49h),  that  by  differentiating  (491)  now  once,  one  obtains 


m-l 

f  U(m)+  V 
m  v 

v  =  0 


(U)  -  E  )  g  [m-vl  (U)  =  p 

v  m-l 


(49m) 


Each  individual  term  beyond  the  first  on  the  left  side  of  Eq.  (49m)  is  dif¬ 
ferentiable.  Hence  the  first  term  must  also  be  differentiable,  and  this 
readily  implies  U'  1  differentiable.  By  successively  differentiating  (49m) 
now  one  can  similarly  conclude  successively  that  ....  are 

differentiable,  until  one  finally  concludes  that  U  has  continuous  derivatives 
in  [a,  b]  through  order  2m. 

20.  Existence  of  minimising  functions  for  R  and  R.  It  will  now  bo 
shewn  that 


R  (U)  =  B 


(50a) 


Since  U  is  admissible,  it  follows  from  (49d)  that 


(u^,  U)  -  B  i  (u^,  U)  —  0  as  h  —  oo 


(50b) 


In  passing  to  the  limit  here,  statement  (4)  of  Section  18  implies  that  one 

may  replace  u,  by  U  on  the  left  side  of  (50b)  in  all  terms  other  than  the 
n 

(rr\  1 

single  term  containing  u^  .  This  term  is 


ta)  (m)  (m-l)  u<m) 

rr:  h  h  m 


f  U(m))*dx 

J  m  h  m 

(50c) 


One  may  now  consider  the  limit  of  the  right  side  of  (50c)  as  h  oo,  and 
note  that  here  one  may  replace  by  U  the  limit.  The  result  then  be¬ 
comes  identically  f  U^m'  dx,  which  is  equivalent  to  replacing  u^ 

a 

by  U  on  the  left  side  of  (50c).  It  follows  then  that  (50b)  implies  $  (U)  -  B  (U) 

=  0,  which  is.  equivalent  to  (50a),  since  4»  (U)  =  1. 

Thus,  it  has  been  proved  that  the  function  U(x)  is  indeed  a  minimiz¬ 
ing  admissible  function  for  R  (u).  It  has  also  been  proved  that  this  function 
has  continuous  derivatives  through  order  2m  in  [a,  b].  Hence  from  Section 
13,  it  follows  that  U  will  also  be  a  comparison  function  satisfying  (l)-(3),  and 
hence  an  eigenfunction  of  (l)-{3)  corresponding  to  the  lowest  eigenvalue 
=  B.  Thus  U  is  also  a  comparison  function  minimizing  the  Rayleigh 
quotient  R.  The  derivation  given  here  shows  that  when  the  energy  quotient 
R  is  minimized, this  eigenfunction  can  be  obtained  from  a  minimizing  sequence 
of  admissible  functions  which  need  only  satisfy  the  geometric  boundary  con¬ 
ditions,  and  have  a  continuous  (m  -  l)th,  and  piecewise  continous  m' th 
derivative  in  [a,  b].  The  existence  of  relevant  minimizing  functions  for  R 
(and  thenc  s  for  R)  for  the  k’  th  mode  can  be  similarly  proven,  with  the 
functions  u^(x)  now  required  to  he  orthogonal,  in  the  sense  of  di  (u^,  y^), 
to  the  first  (k  -  1)  eigenfunctions. 

It  is  noted,  finally,,  that  in  assumptions  (44)  the  condition  f  +  0, 

which  is  used  only  in  the  proof  of  statement  (1)  of  Section  18,  can,  if  necessary, 
often  be  relaxed.  It  will  suffice  to  Hlustrate  this  with  a  differential  equation 

ii 

of  the  form  (  p(x)  vn  )  -*■  X  y"  =  0,  p(x)  >  0.  Such  an  equation  pertains, 

* 

for  example,  to  the  static  buckling  of  a  column.  Clearly,  the  condition 

f  +  g  a£0  does  not  hold  here.  It  is  noted,  however,  from  the  remainder 
o  o 

of  assumptions  (44)  and  from  Eqs.  (34b,  c)  and  (45a,  b)  that  in  general,  re- 

k  2 

gardless  of  f  and  g  ,  J  gj(x)  u*^  dx  must  be  bounded  as  h  —  oo.  In  the 

a 
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2 

present  example,  g^v)  =  1;  hence  f  u' ^  dx  must  be  hounded.  Therefore, 

a 

from  auxiliary  theorem  (/)  of  Section  15  with  q  =  1,  statemert  (1)  of  Sec¬ 
tion  18  remains  valid  at  least  for  1  <  v  <  m  -  1.  If  now  (as  would  usually 
be  the  case  in  the  present  illustration)  one  of  the  boundary  conditions  re¬ 
quires  that  y  itself  vanish  at  x  =  a  or  b,  then  statement  (1)  will  hold  also 
for  v  =  0,  and  thus  for  all  0  <  v  <  m  -  1,  as  previously.  If,  on  the  other 
hand,  y  is  not  prescribed  to  vanish  at  an  end,  but  is  included  in  the  boundary 
conditions  and  appears  in  or  Nq,  then  from  the  assumptions  0,  Nq-  0 

and  the  other  assumptions  in  (1-1),  in  conjunction  with  (45a,  b),  it  maybe  readily 
inferred  that  y  itself  must  remain  bounded  at  an  end  point,  so  that  state¬ 
ment  (1)  is  /alid  also  for  v  =  0.  Finally,  if  the  boundary  conditions  do  not 
involve  y  at  all,  then  for  self-adjointness,  y  will  not  appear  in  either  Mq 
or  Nq  (for  admissible  functions),  and  y  itself  will  hence  not  be  relevant. 

The  existence  proof  can  then  be  carried  out  for  v  >  1,  to  show  that  there 
will  exist  an  admissible  function  U*  (x)  for  which  R  attains  its  greatest  lower 
bound  (minimum). 

21.  Unbounded  growth  of  the  eigenvalues.  When  sufficient  condi¬ 
tions  such  as  (44)  hold  so  that  the  minimizing  functions  for  R  or  R  ,  appear¬ 
ing  in  the  theorems  of  Sections  8,  9  and  13  exist,  then  as  seen  in  Section  9, 
there  will  be  an  infinite,  but  discrete,  set  of  eigenvalues  of  (l)-(3)  which 
can  be  ordered  in  a  non-decreasing  sequence  <  X.^  <  ...  .  Henceforth 
under  such  conditions,  the  system  (l)-(3)  will  be  called  w  discrete*  .  It  will 
now  be  shown  that  for  such  systems  the  eigenvalues  are  unbounded  as 
k  —  ao,  i.  e. 

lim  =  oo  (51) 

k  —  oo 


This  can  be  proven  by  supposing  the  contrary  to  hold.  Then  the 
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X^  would  have  an  upper  bound, and  from  auxiliary  theorem  (e)  of  Section  15, 

lim  X,  *  A  (52a) 

k  —  oo 

\ 

where  A  is  a  finite  number.  Let  be  the  corresponding  normalized 
eigenfunctions.  Then,  since  H  (y^)  *  X^, 

T  (yk)  =1.  lim  «(yk)=  A  (52b) 

k  —  oo 

Thus  the  y^would  satisfy  the  same  conditions,  namely  admissibility  and 

Eqs.  (45a,  b)  (with  B  replaced  by  A)  as  the  u  functions  of  Section  18. 

h 

Following  now  the  proof  (statements  (l)-(4))  in  that  Section,  it  is  then 
clear  (under  assumptions  (44))that  from  the  y^  one  could  form  a  subsequence 
of  functions,  that  may  still  be  denoted  by  y^,  which  would  converge  uni¬ 
formly  in  [a,  b]  to  an  admissible  function  Y  (say )j  moreover  the  y  ^ 

1C 

would  converge  uniformly  toY^V\  0  <  v  <  m  -  i.  From  this  it  follows 

that  y^  v)(x)  -  y^  (x)  r-  0  in  [a,  b]  as  p,  q  —  oo  ;  hen  ce,  since  n  <  m-1, 
r  4 

lim  4?  (y  -  y_)  *  0  (52c) 

p,  q  —  co  P  * 

Since  Eq.  (39)  holds  for  vjj  ,  however, 

+  (y  -  yq)  =  (yp)  -  z*\>  (yp,  yq)  +  +  (yq)  (52d) 

Hence,  since  i|<  (y. )  8  <1*  (yQ)  8  1  and  4>  (y  ,  y  )  =  0,  it  follows  from  (52d) 

P  H  P  9 

that  in  general,  for  any  two  orthonormal  eigenfunctions  yp  and  y  , 

+  (y  -  vq)  =  2  (52e) 

This  contradicts  (52c),  and  hence  contradicts  the  assumption  that  (51)  is 
false.  Thus  (51)  is  proven.  It  may  be  noted  that  (51)  implies  that  any 
repeated  eigenvalues  can  each  have  only  a  finite  multiplicity. 
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V.  COMPLETENESS,  CLOSEDNESS  AND  EXPANSION  THEOREMS 

22.  Bessel1  3  Inequality  and  Parseval*  s  Equation.  Let  A  be  a 
positive  definite  linear  differential  self-adjoint  operator,  and  let 
q?  2»  •  •  •  be  3  se^  comparison  orthonormal  functions  with  respect 
to  A,  i.e.,  satisfying 


(A  cp  cp  j)  =  (cp  A  -j;  .)  =  5.j  (53a) 

co 

Then  the  series  c-  cp  .(x)  will  be  said  to  converge,  in  the  norm  of  the 
i=  1  1 

operator  A,  to  a  function  f(x)  in  fa,  b]  if 


lim  J  |f  -  u  I  |2  =  lim  J  (f  -  u  )  A  (f  -  u  )  dx  =  0 


(53b) 


oo 


oo  a 


where 


Vx)  a  Vi(x) 


(53c) 


^  2 

The  quantity  e  a  |  |  f  -  u  |  |  .  may  be  regarded  as  the  mean  squared 

r  p  n 

error  in  [a,  b],with  the  operator  A,  of  the  finite  series  representation, 

P 

i=ViTi,C;  f(x/.  When  A  =  I,  the  identity  operator,  f e^  |  is  the  "norm" 
of  (f  -  u  ),  and  the  subscript  I  (in  place  of  A)  is  then  omitted  in  the  notation 
(cf.  Section  2).  Moreover,  one  then  simply  speaks  *  convergence  in 
the  mean*  of  u  to  f  without  mention  of  an  operator.  For  the  more  general 

g 

operator  A,  the  terminology  ’'energy  convergence*  may  be  used  to  mean 

that  (53b)  is  satisfied;  [e  |  may  then  be  called  the  "  energy  norm"  ,  or 

P 

"A-norm"  of  (f  -  u  ). 

P 

A  further  condition  which  will  be  imposed  on  the  operator  A  here 

g 

is  that  it  be"positive-bounded-below"  ,  also  called  more  simply  "  strictly 


positive"  .  This  means  that  if  u  is  any  comparison  function,  then 
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l|u|(A  >  V  I  |  u  |  | 


(53d) 


where  y  is  some  fixed  positive  constant,  and  where,  in  accordance  with 

the  notation  herein,  (see  also  Section  2),  |  |u  |  |*  (Au,  u)1  ,  j  [  u|  |  =  (u,  u)1  . 

A 

The  following  interpretation  of  (53d)  is  particularly  pertinent  here.  Con¬ 
sider  the  eigenvalue  problem  Ay  =  Xy,  with  associated  boundary  conditions; 

A  is  assumed  self-adjoint  and  positive  definite.  Suppose,  in  addition,  that 
the  spectrum  of  eigenvalues  is  discrete.  Then  there  will  be  a  lowest  eigen¬ 
value  Xj  >  0,  which  will  be  the  minimum  value  of  the  Rayleigh  quotient 

R(u)  =  (Au,  u)/  (u,  u)  =  |  (u  |  | ^  /  J  Ju  |  |2.  Hence  |  |u  J  /  f  |u  |  [2  >  Xj, 

l/  2 

whence  (51d)  holds  with  y  =  X^  .  Thus,  if  A  is  positive  definite,  self- 
adjoint,  and  the  eigenvalue  problem  Ay  =  Xy  has  a  discrete  spectrum, 
then  (53d)  will  hold.  An  important  consequence  of  (53d)  is  that  under  this 
condition,  convergence  in  the  norm  of  A  will  imply  convergence  in  the  mean. 


This  follows  readily  by  noting  that  if  f  is  a  comparison  function,  then  so 

is  f  -  u  ;  hence  from  (53d),  0  <  |  jf  -  ii  |  |  <  (l/  y)  |  |f  -  u  }  |  .  Therefore 
p  p  p  A 

I  !f  “  UP I  Ia~  0  ^p1168  I  lf  -  up  1 1  —  0  as  p  —  co. 

If,  for  all  f  of  a  given  class,  a  set  cf  constants  c^  (depending  on  f) 
exists  so  that  (53b)  holds,  then  the  set  of  functions  cp^  is  said  to  be  com¬ 
plete,  in  the  norm  of  A, for  such  functions.  By  formally  writing 
f  =  Yj  Cj  <p  y  operating  with  A  on  both  sides  term  by  term,  multiplying 
bytp^,  integrating,  noting  Eqs.  (53a)  and  assum'ng  also  (cp  .  Af)  =  (f,  A<p  ^), 
one  obtains 


Cj  =  J  f  A  (cp  .)  dx  *  (f,  A  5,  .) 


The  quantities  (f,  A  <p  .)  are  called  the  (generalized)  Fourier  coefficients 
of  the  function  f.  Denoting  cu  =  (f,  A  $  it  is  readily  found  that  for  any 
set  of  c^, 
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'p*  <f-  Af>-2.i .  ci  °i  *  x c? 

1  1  =  1  i=j. 

p  ,  p  , 

=  {£,  Af)  -  ^  Qj  +  ^  (Cj  -  a.)2 


(55a) 


From  (55a)  it.  follows  that  the  mean  squared  error  will  be  a  minimum 

with  respect  to  the  c.  (for  given  f  and  $  .)  when  c.  =  a..  Thus  the  Fourier 

coefficients  are  the  values  of  the  c.  for  which  e2  is  minimized.  More- 

1  P 

over,  when  c.  is  given  by  Eq.  (54),  (i.  e»,  c.  =  a.),  it  follows  from  (55a) 


that 


Since  e 


2 

P 


c2  =  (f,  Af)  -  e2 
i  '  p 

for  any  p,  it  follows  that 


a> 

)  c 2  <  (f,  Af) 

i^l  1 


(55b) 


(56) 


(56)  is  known  as  Bes^l' s  inequality,  and  shows  that  in  general,  if  (f,  Af) 

co 

is  bounded  and  the  c.  are  the  Fourier  coefficients  of  f,  then  /,  c.  con- 

1  Z  i*1  1 

verges.  Moreover,  from  (55b)  it  follows  that  lim  e  =  0  if  and  only  if 
°  p— co  p 


oo 


.Z,  s2  --  <f- A£> 


(57) 


Eq.  (57)  is  knovn  as  Parseval1  s  equality,  or  the  completeness  relation. 

It  is  seen  to  be  equivalent  to  the  completeness, in  the  A-norm,of  the  set 
of  functions  m  . . 

i 

The  Parseval  equation  (57)  can  be  generalized  by  considering  any 
two  comparison  functions  u(x)  and  v(x)  such  that  the  completeness  relation 
(57)  holds  for  u  and  v  separately,  and  also  for  (u  +  v).  Then  inserting 
f  =  (u  +  v)  in  (57),  and  letting  a^  and  b^  denote  the  Fourier  coefficients 
(defined  by  (54))of  u  and  v,  respectively,  Eq.  (57)  yields 


1 
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.Lj  <»i + v2 


b 

=  f  (u  +  v)  A  (u  +  v)  dx 
'a 

=  (u,  Au)  +  (v,  Av)  +  2  (u,  Avl 


oo  oo 

Observing,  from  (57),  that  V  a .  =  (u,  Au)  and  7  b .  =  (v,  Av),  it 

1=1  1=1 

follows  that 

oo 

Ja  b.  =  (u,  Ay)  (58) 

i^  1  1 

Eq.  (58)  may  be  regarded  as  the  *  generalised"  Parseval  equation. 


23.  Completeness  of  the  eigenfunctions  in  the  norms  of  N  and 
of  M.  Consider  cow  the  eigenvalue  problem  (1)  -  (3),  with  M  and  N 
self-adjoint  and  positive  definite.  Further  condi ticns  on  f  (x)  and  g  (x), 
such  as  assumptions  (44), are  assumed  which  are  sufficient  for  the  exist¬ 
ence  of  the  minima  of  the  Rayleigh  quotient  with  respect  to  comparison 
functions.  Then  it  will  oe  shown  first  that,  in  the  norm  of  the  operator 
N,  the  orthonormalized  eigenfunctions  y.(x)  (satisfying  (5* a)  with  A  =  N; 
cf.  Eqs.  (8a))  form  a  complete  set  for  comparison  functions  f(x),  i.e. 


oo 


J  c7  =  J  f  N(f)  dx 
i=l  1  a 


(59) 


where,  in  accordance  with  (54), 


=  J  f(x)  N(y.,)  dx 


(60) 


To  prove  Eq.  (59),  let 


rjx)  =  f (x)  '  ^  ci 


(61a) 


where  is  given  by  Eq,  (60).  Then  r^(x)  is  a  comparison  function,  and 


moreover, 


(i  =  1,  •  •  •  >  p) 


(61b) 


/  r  N(y  )  dx  =  0 
J  p  1 


Thus  r^  is  orthogonal  to  the  first  p  eigenfunctions.  Consequently,  from 

the  recursive  minimum  characterization  of  X  ,  , 

P+1 


®  (r  )/ «|»  (r  )  =  R'r  )>  X 
P  P  P  P+1 

From  M(y^)  -  X.  N(y.),  Eq.  (61b)  implies 


(61c) 


/  r  M(y£)  dx  =  0 


(i  =  1,  • . . ,  p) 


(6ld) 


Putting  f  *  r  +  1  c.  y.(x),  and  using  (8b).  (10)  and  (6ld),  one  readily 

P  1  1 


finds 


b  P 

*  (f)  =  /  f  M(f)  dx  =  #  (r  )  +  £  cf  X. 


(61e) 


Since  (f)  is  fixed  and  bounded,  and  $  (f)  >  0,  #  (r  )  >  0,  X.  >  0  it  follows 

P  ^ 

from  (61e)  that  $  (r  )  is  bounded  as  p  —  oo.  But  from  (6lc)  i|*(r  )  <  $  (r  )A  . 

P  P  ~  P  P+l 


Since  V^-  xas  p-*oo,  this  then  implies 


lim  vJj  (r  )  =  0 
P 

p  — ■  oo  ^ 


Eq.  (62)  is  identical  to  (53b),  with  A  replaced  by  N.  Thus  the  completeness, 
in  the  norm  of  N,  of  the  set  y^(x)  for  comparison  functions  f  is  proven. 

Eq.  (59)  then  follows  from  Parseval' s  equality  (57)  with  A  =  N. 

From  Eq.  (61e),  since  5  (r^)  >_  0  ,  it  follows  that 
QO  2  <  b 

V  c  J  X.-  J  f  M(f)  dx  (63) 

i=l  a 

oo 

Inequality  (63)  shows  that  ^  c.  X  .  is  convergent. 

i=  1  11 

The  above  proof  can  be  modified  by  using  the  energy  quotient  R  , 
and  the  associated  functions  $  and  41  in  place  of  R,  $  and  (cf.  Section  13). 
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The  completeness  relation  then  becomes 
co 

X  A  =  *  (f)  (64a) 

1=1  1 

where  now 

Cj  =  (f.  Yj)  (64b) 

Moreover,  (64a,  b)  now  hold  for  admissible  functions  f  which  need  only 
satisfy  the  geometric  boundary  conditions,  and  have  a  continuous  (m-l;th, 
and  piecewise  continuous  m*  th,  derivative  in  [a,  b]. 

The  completeness  relations  derived  above  can  be  extended  to  a 
wider  class  of  functions  f  than  described  there.  For  this  purpose,  it.  is 
first  noted, as  observed  in  Refs.  4  and  6,  that  for  the  special  case  of 
N(y)  =  gQ(x)  y,  the  set  of  comparison  functions,  to  be  denoted  by  f,  for 
the  problem  (1)  -  (3)  is  "dense*  in  the  space  of  square -integrable  functions 
f,  that  is:  given  such  an  f  and  any  e  >  0,  there  exists  an  f  such  that 

|  |£  -  T|  |N  <  «  (65a) 

For  more  general  N(y)  as  defined  by  Eq.  (3)  one  may  consider  the  Dirichlet 

form  (cf.  Eq.  (28)),  obtained  by  integrations  by  parts,  of  |  |f  -  f  I  L T  = 
b  _  N 

J  (f  -  f )  N  (f  -  f )  dx  =  4j  (f  -  f ).  This  will  indicate  that  the  denseness 
a 

property  (65a)  cf  the  set  of  comparison  functions  f  will  hold  if  f  is  now  a 
function  whose  N-norm  exists,  and  which  satisfies  enough  of  the  boundary 
conditions  (2)  so  that  the  boundary  terms  in  the  Dirichlet  form  of  ip  (f  -  f ) 
can  be  made  zero  or  as  small  as  desired.  The  derivatives  involved  here 
are  of  order  (2n-l)  and  less,  and  examination  of  i}»  (f  -  f)  will  indicate  that 
the  above  requirement  v,n  the  boundary  terms  can  be  satisfied  by  prescrib¬ 
ing  2n  or  fewer  boundary  conditions  for  f.  By  the  use  of  Eq.  (65a)  for  such 
functions  f,  it  can  be  shown,  following  the  type  of  proof  in  Refs.  4  or  6, 
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that  the  eigenfunctions  of  (1)  -  (3)  will  be  complete  in  the  norm  of  N  for 
this  more  general  class  of  functions.  This  implies  that  Eq.  (59)  will  hold 
for  such  f;  moreover,  Eq.  (53)  will  also  hold,  with  A  =  N,  and  u  and  v 
any  two  functions  of  this  class  f. 

With  the  use  of  the  observations  in  the  preceding  paragraph,  it  can 
now  be  shown  that  the  orthonormalized  eigenfunctions  of  (1)  -  (3)  will 

be  a  complete  set,  in  the  norm  of  M,  for  comparison  functions  u.  This 

« 

can  be  proven  by  applying  the  generalized  Parseval  Eq.  (58)  with  A  =  N, 
u  =  any  given  comparison  function,  and  v  a  function  such  that 

N(v)  =  M(u)  (65b) 


and  satisfying  sufficient  boundary  conditions  (as  explained  in  the  preceding 
paragraph)  so  that  a  comparison  function  f  will  always  exist  such  that 
(65a)  will  hold  with  f  =  v,  and  so  that  N  will  be  self-adjoint  for  such  func¬ 
tions  v.  Then,  in  Eq.  (58), 


b. 

l 


a.  =  J  u  N(Yi)  dx  -  c. 
a 

b  b  b 

=  J  v  N(y.)  dx  =  J  y^  N(v)  dx  =  J  y^  M(u)  dx 
J  a  1  J  a  a 

b  b 

*  J  u  M(yj)  dx  =  J  u\.N(y.)dx=  c{ 


(65c) 


(65d) 


Eq.  (58),  in  conjunction  with  Eqs.  (65b)  -  (65d),  then  yields 


oo 


X  xi  =  /  * 

1=1  a 


M(u)  dx 


(66) 


Eq.  (66)  is  the  same  as  Eq,  (63),  with  the  inequality  sign  now  replaced 
by  an  equality.  Eq.  (66)  signifies  the  completeness  of  the  eigenfunctions, 
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in  the  norm  of  M,  for  comparison  functions  u  . 

24.  Closedness  theorems.  An  intex'esting  and  instructive  alter¬ 
native  method  of  proving  the  completeness  of  the  eigenfunctions,  both  in 
the  norm  of  N  and  of  M.  will  now  be  shown.  For  this  purpose,  the  concept 
of  closedness  of  a  set  of  orthonormal  functions  $  .  will  be  introduced. 

sjcafc 

This,  as  will  be  seen,  is  intimately  related  to  completeness  .  A  set 
of  orthonormal  functions  <p  .  (x)  will  be  called  closed  with  respect  to  func¬ 
tions  f  of  a  given  class  if,  under  a  given  positive  definite  operator  A 
(assumed  also  to  be  linear  and  self-adjoint),  there  is  no  normalized  function 
f  which  is  orthogonal  to  every  ®  ^  in  the  set.  One  might  expect  intuitively 
that  closedness  and  completeness  of  a  set  are  equivalent.  This,  indeed, 
is  the  case  for  functions  f  for  which  (f,  Af)  exists  (in  the  generalized, 
Lebesgue  sense).  To  prove  that  completeness  implies  closedness  is  com¬ 
paratively  simple.  For,  suppose  a  set  is  complete,  but  not  closed.  Then 
there  would  exist  a  normalized  function  f  such  that  c.  =  (f,  A  $  .)  =  0  for  all  i. 

p  11 

Hence  lim  |  |f  -  T  c.  <p  .  |  |  =  |  |f[  |  =  (f,  Af)  =  1,  which  contradicts 

p  — -  oo  x=l  1  1  A  A 

the  supposition  that  the  set  <p  .  is  complete.  The  converse,  namely  that 
closedness  implies  completeness,  can  be  proven  by  using  the  well-known 
Fischer-Riesz  theorem  of  functional  analyte  '  This  theorem  states  that 


*  Actually,  as  seen  in  this  derivation,  it  is  sufficient  that  u  satisfy 
enough  of  the  boundary  conditions  in  (2)  so  that  (65a)  will  hold,  and 
b  b 

so  that  J  y^  M(u)  dx  =  J  u  M(y^)  dx. 

cl  S 

**  In  fact,  the  definitions  of  "completeness"  and  "closedness"  of  a  set 
of  functions  are  sometimes  interchanged  in  the  literature. 
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every  Cauchy  sequence  f  ,  of  functions  f  for  which  the  A-norm  (*,  Af) 
exists,  converges  in  the  norm  of  A  to  such  a  function.  A  proof  of  this 
theorem  may  be  found,  e.g. ,  in  Refs.  7  and  17  for  A  *  I,  the  identity 
operator.  The  extension  in  the  form  stated  above,  to  a  general  positive 
definite,  self-adjoint  operator  A,  satisfying  also  (53d),  is  proven  by 

g 

MiKilin  (pp.  100-109).  With  this  theorem,  the  statement  that  closedness 
implies  completeness  can  be  proven  as  follows.  Suppose  the  set  of  ortho¬ 
normal  functions  cp  .  is  not  complete.  Then  by  (56)  there  will  be  a  function 


f(x)  for  which 


(f,  Af) 


■Si  < 


(67a) 


where  c.  is  given  by  (54).  Consider  now  the  sequence  of  functions 

gP  =  f  ‘  2j  ci  <P  i-  Then  I  le„  -  8.  1 12  cg— 0  as  p,  q— CD  ,  since 

“>  2  U1  P  A  Uptl  1 

)  c.  is  convergent.  Hence  g  is  a  Cauchy  sequence,  and  by  the  Fischer- 
i^i  1  p 

Riesz  theorem  must  therefore  conveige,  in  the  norm  of  A,  to  a  function 
g(x)  whose  A-norm  exists.  This  function  will  be  orthogonal  to  all  of  the 
cp  ,  F or,  (g,  A  cp  )  a  (g  -  g  A  <p  )  +  (g  ,  A  cp . )  for  any  p.  But  for 

A  A  P  1  p  1 

P  >  (gp.  A  <P  4)  s  Cj  -  c.  S  o.  .’ .  (g,  A  <p  .)  s  (g  -  gp,  A  cp  j)  for  all 

p  >  i.  Moreover,  given  any  c  >  0,  there  will  be  a  P  such  that  for  p  >  P, 

(g  -  g  ,  A  cp  .)  <  €  .  Since  (g,  A  cp  ^)  is  independent  of  p,  it  then  follows 

that  (g,  A  cp  .)  =  0.  It  will  now  be  shown,  further,  that  f  [g  }  [  >0.  For, 

1  A 

noting  that  g  =  +  g^,  where  rjp  =  g  ■  g^,  it  is  seen  that 


I  |g  1 1  =  I  In  I  r  +  2  (n  ,  a  g  )  +  I  [g  i  I2 

A  p  A  P  P  Pa 


(67b) 


i.  e. ,  a  sequence  in  wh'ch  |  |f  -  f  [  f  —  0  as  p,  q  —  oo. 

P  ^  A 


This  holds  for  all  p;  but  for  sufficiently  large  p  the  fir3t  three  terms  will 

2  P 

be  arbitrarily  small  in  absolute  value,  while  |  Jg  |  |  -  tt  nf\  V  2 

P  .A  =  **  ***'  “  J_j  ci  • 

In  view  of  (67a),  it  therefore  follows  that  for  sufficiently  large  p,  the 

right  side  of  (67b)  will  be  positive.  Hence  |  |g  |  |  >0.  Thus  g  is  a 

A 

normalizable  function  which  is  orthogonal  to  all  the  $  Hence  the  $  ^ 
cannot  be  closed,  if  they  are  not  complete. 

Suppose  now  that  the  system  (1)  -  (3)  is  self-adjoint,  with  M  positive 
semidefinite  and  N  positive  definite,  and  that  the  system  has  a  discrete 
spectrum.  Then  it  will  be  shown  that  the  eigenfunctions  of  the  problem 
I x)  -  (3)  are  closed,  under  the  operator  N,  for  comparison  functions  f.  To 
prove  this  it  is  noted  that  \  is  the  minimum  value  of  R(u)  with  respect  to 
comparison  functions  u  for  which  i]>  (u)  =  1  and 

(u,  Nyj)  =  0,  (u,  Ny2)=  0,  ...,  (u,  Ny  j)  =  0  (67c) 

Thus,  X^>  ^  . . .  >,  Xp  Suppose  the  set  of  eigenfunctions  y.  were 

not  closed.  Then  there  would  exist  normalized  comparison  functions  u 
which  satisfy  (67c)  for  all  the  y^,  i  =  1,  2,  . . .  .  For  such  functions  u, 

R(u)  must  exist,  i.  e,  be  finite.  Moreover,  since  R(u)  >  0,  R(u)  must 
have  a  greatest  lower  bound,  say  X  .  But  then  X  must  be  at  least  as 

large  as  each  of  the  X^,  i  =  1,  2 .  This,  however,  is  impossible, 

since  X.  -*  oo  as  i— -  co.  Hence  the  system  of  y^  must  be  closed.  Note 
that  this  proof  does  not  use  any  completeness  theorems,  and  may  hence  be 
considered  as  an  alternative  proof  of  completeness. 

In  a  similar  fashion,  it  can  be  shown  that  the  system  of  eigenfunc¬ 
tions  y^  is  closed  not  only  under  the  operator  N,  but  also  under  M,  when 
M  is  positive -definite.  To  prove  this,  it  is  first  recalled  that  the  y^  are 
orthogonal  with  the  operator  M,  i.  e.,  (y^,  M(y. )  )  =  0,  i  ^  j  (Eq.  8b). 
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It  is  then  noted  that  the  eigenvalue  X^  can  be  characterized  as  follows! 

X  is  the  minimum  value  of  <£>  (u)  with  respect  to  comparison  functions  u 
for  which  v|>  (u)  =  1  and  (u,  M(y^)  )  =  0,  i  =  1,  2, . . , ,  p  -  1,  This  can  be 
shown  in  a  manner  quite  similar  to  that  already  shown  in  Section  9  for  the 
corresponding  theorem  involving  N  (instead  of  M);  In  the  equation  im¬ 
mediately  before  Eq.  (27),  the  (6u,  y^)  are  replaced  by  $  (  6u,  y.)> 

1  =  1,  . . . ,  k  -  1.  The  proof  then  proceeds  as  in  that  section,  with  the 
observation  that  N(y^)  =  (1/ X  M(y.),  and  with  the  result  that  $  (y.)  = 

2  [1  -  (i<k/X.)]  $  (u^,  y.).  But  now  $  (u^,  y^)  =  0.  Hence,  since  $(y.)>0, 
k.  =  0  and  the  proof  then  continues  exactly  as  in  Section  9.  With  this 
characterization  of  the  eigenvalues,  the  proof  that  the  eigenfunctions  are 
closed,  under  the  operator  M,  is  then  the  same  as  that  in  the  preceding 
paragraph.  From  this  one  may  now  infer  that  the  eigenfunctions  Y^,  or- 
thonormalized  with  the  operator  M,  form  a  complete  set  for  comparison 


functions  f.  Note  that  if  4<  (y^)  =  1*  then 


Y.(x)  =  y.(x)  X  . 


-1/2 


(68a) 


Moreover,  in  the  series  £  c!  Y^x)  to  represent  f  now,  the  values  of  the 
c'.  according  to  Eq.  (54),  in  conjunction  with  Eq.  (60)  for  c^,  will  be 

.  b  b 

c .  =  f  f(x)M(Y.)dx  =  f  f(x)  X.  N(Y.)  dx 


b  -1/2  1/2 
=  J  f(x)  Xt  N(y.)  Xt  dx  =  X  .  c. 


(68b) 


a 

v  » 


Hence  the  series  £  c.,  Y^x)  becomes  £c.  y.(x),  as  in  the  case  of  the 
operator  N,  and  the  completeness  of  the  functions  Y^(x)  in  the  M-norm 
signifies  that  $  (r^)  —  0  as  p  —  oo,  where  r^  is  still  defined  by  Eq.  (61a). 
Eq.  (61e)  then  yields  the  M-norm  completeness  relation  (66),  This  may 
be  considered  as  an  alternative  proof  of  Eq.  (66)  to  that  given  in  Section 


23. 


-63- 


25.  Expansion  theorems,  Eqs.  (59)  and  (66)  are  the  chief  results 
obtained  in  this  Chapter,  and,  as  will  be  seen  in  a  second  report,  play 
an  important  role  in  the  theory  of  energy  methods.  It  is  noted,  for  ex¬ 
ample,  that  Eq.  (66)  leads  to  the  same  result  that  wou'.d  be  obtained  if 

oo 

one  first  formally  writes  f(x)  =  V  c.  y.(x),  operates  termwise  with  M 

i=  1  1  1 

(which  involves  2m  differentiations)  on  both  sides,  multiplier  both  sides 
by  f  and  then  iutegrater  termwise  over  [a,  b].  Eq.  (66)  justifies  mathe¬ 
matically  the  final  result  of  such  an  operation,  at  least  for  any  comparison 
function  f.  It  should  be  observed,  however,  that  in  spite  of  the  validity 
of  the  final  results  of  such  an  operation,  it  has  not  yet  been  established 
here  that  the  equattor.  «*,  -  jj  c,  ysC  itee«  actuaUy  hold,  aaifo^y 
for  all  x  in  [a,  b].  Such  a  statement  would  be  an  "  expansion  theorem"  . 
Expansion  theorems  are  developed  in  Refs.  4  and  12  for  Sturm -Liov  ville 
(second-order)  equations,  and  in  Refs.  1,  6  and  18  for  more  general  sys¬ 
tems  of  the  type  (1)  -  (3). 

As  will  be  seen  in  a  subsequent  report,  it  is  actually  the  complete¬ 
ness  relations  of  the  type  (59)  and  (66)  which  will  suffice  for  the  further 
developments  to  be  given  there  of  the  theory  of  energy  methods.  The 
following  remarks  on  expansion  theorems  may  nevertheless  be  made  here. 
Considering  first  A  =  I,  the  identity  operator,  in  the  definition  (53b)  of 

completeness,  it  is  noted  that  convergence  in  the  mean  signifies  that 
oo 

lixn  [f(x)  -  /  c.  .  (x)l  will  be  zero  "  almost  everywhere  "  (in  the 
,  i  ■  l 

p—  oo  1=1 

Lebesgue  sense)  in  [a,  b],  i.  e. ,  the  limit  will  be  zero  at  all  x  in  [a,  b] 
with  the  possible  exception  of  a  set  of  points  (of  ■  zero  measure"  )  which 
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wculd  not  contribute  to  the  integral  in  (53b) 

For  A  ~  I.  sufficient  conditions  for  the  uniform  convergence  of 

N  c.-  .  fx)  to  f(x)  in  [a,  bl  would  be:  f(x»  continuous,  all  t  .(x)  continuous, 

-  ^  11  *  1 
1=1  oo 

and  '  c.  .  (x)  uniformly  convergent  in  [a,  b], 
i=*l  1  1 

Consider  now  the  self-adjoint  system  (1)  -  (3)  with  M  and  N  positive 
definite  such  that  the  eigenvalue  spectrum  is  discrete,  and  such  that  the 
system 

M  (u)  =  A  u  (69) 

with  the  same  boundary  conditions  will  also  have  a  discrete  positive  soect- 
rum.  Then  M  will  be  a  strictly  positive  operator,  i.  e.,  M  satisfies  (53d) 
(with  A  =  M,  y  =  Aj  >  0,  the  lowest  eigenvalue  of  (69)).  -  rom  the  com¬ 
pleteness,  in  the  norm  of  M,  of  the  eigenti  nctions  y.  of  (1)  -  (3)  for  com¬ 
parison  functions  f(x),  established  in  Sections  23  and  24,  it  then  follows 
that  the  y.  will  also  be  complete  in  the  sense  of  ordinary  convergence  in 
the  mean  (i.  e.  (53b)  will  hold  with  A  =  I,  r  .  -  y.).  Conseouently,  the 

CD  it 

following  statement  holds:  If  Y  c.  v.(x)  is  uniformly  convergent  in 

i^l  1  1 

(a,  bj,  where  the  y.  are  the  or th ©normalized  eigenfunctions  of  (1)  -  (3)  with 

1  CD 

the  operator  N,  and  the  c.  are  given  by  (60),  then  T  c.  y.(x)  will  converge 

i  iri  1  1 

uniformly  to  f(x)  for  all  x  in  [a-b]„  It  is  shown  in  Ref.  1  (pp.  144-145)  that 
oo  ,  . 

actually  c.  y.  (x)  ,  0  <  v  <  m  -  1,  converges  uniformly  (and  absolutely) 
i*l  11  op 

in  [a,  b],  Consequently,  the  series  ;  c.  y.(x)  will  converge  uniformly  to 

i*l 


T  As  a  simple  example,  with  a  sequence  (instead  of  a  series),  let 
f(x)  =  0  in  [0,  1],  and  consider  the  sequence  f  (x)  =  (1  +  px)"  V  ’ 

in  [0,1],  p  =  1.2,  ...  .  Then  ]  ff  -  £  !  J2  =  ]  fo  -  f  f  \Z  = 

r  r 

1 

f  (1  f  px)’  dx  =  (1/  p)  log  (1  +  p).  Hence  asp—*  CD,  ]  |0  -  f  I  -*  C. 
'  0  P 
Thus  the  sequence  f  (x)  converges  in  the  mean  to  f(x)  =  0.  The  se¬ 
quence  f  (x)  itself,  however,  is  readily  seen  to  converge  to  zero  at 
all  x  in  fo,  1]  except  at  x  =  0,  where  f  =  1  for  all  p  (and  hence 

f  —  1).  N'ote  that  the  sequence  f  (x)  duos  not  converge  uniformly 
P  P 

in  (0,  1  ]  to  zero. 


1 
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f(x)  for  all  x  in  [a,  b],  and  the  series  may,  in  fact,  be  differentiated  term- 
wise  (m  -  1)  times. 
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